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TITLE OF THE INVENTION 

TRANS-SPLICING RIBOZYMES 



10 Field of the Invention 

The present invention is directed to novel 
ribozymes capable of trans-splicing reactions. 



BRIEF DESCRIPTION OF THE BACKGROUND ART 

I. <--T"'T T Tntrons 

RNA molecules with catalytic activity are called 
ribozymes or RNA enzymes (Cech, T.R. , Ann. J?ev. 
Biochem. 59:543-568 (1990). ' The Tetrahymena 
thermophila precursor rRNA contains an intron (a 
ribozyme) capable of catalyzing its own excision. 
This ribozyme is one of a class of structurally 
25 related Group I introns. 

The splicing activity of the modified T. 
tnermophila intron requires the presence of a 
guanosine cof actor and a divalent cation, either Mg 4+ 
or Mn ++ , and occurs via two sequential transesterif ica- 
tion reactions (Figure 1). First, a free guanosine 
is bound to the ribozyme and its 3' hydroxyl group is 
positioned to attack the phosphorus atom at the 5' 
splice site. The guanosine is covalently attached to 
the intron sequence and the 5' exon is released. 
Second, the phosphodiester bond located at the 3' 
splice site undergoes attack from the newly freed 3' 
hydroxyl group of the 5' exon, resulting in production 
of the ligated exon sequences. The excised intron 
subsequently undergoes a series of transesterif ication 
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reactions, involving its 3' hydroxy 1 group and 
internal sequences, resulting in the formation of 
shortened circular forms. 

These successive reactions are chemically similar 
and appear to occur at a single active site. The 
reactions of self-splicing are characterized by the 
formation of alternative RNA structures as differing 
UNA chains are each brought to form similar 
conformations around the highly conserved intron. 
Splicing requires the alignment of the intron-exon 
junctions across a complementary sequence termed the 
"internal guide sequence" or IGS. 

The first cleavage at the 5' splice site requires 
the formation of a base-paired helix (PI) between the 
ICS and sequences adjacent the splice site. The 
presence of a U:G "wobble" base-pair within this helix 
defines the phosphodiester bond that will be broken in 
the catalytic reaction of the ribozyme. After 
cleavage of this bond, a portion the PI helix is 
displaced and a new helix, F10, is formed due to 
complementarity between the IGS and sequences adjacent 
the 3' splice site. An invariant guanosine residue 
precedes the phosphodiester at the 3' splice site, 
similar to the portion of the PI sequence that it is 
displacing. Thus, ligation of the exons occurs in a 
reverse of the first cleavage reaction but where new 
exon sequences have been substituted for those of the 
intron. It may be noted that intron circularization 
reactions subsequent to exon ligation also involve 
base-pairing of 5' sequences across the IGS, and 
attack mediated by the 3' hydroxy 1 group of the 
intron's terminal guanine residue (Been, M.D. et al . , 
"Selection Of Circularization Sites In A Group I IVS 
RNA Requires Multiple Alignments Of An Internal 
Template-Like Sequence," Cell 50:951 (1987)). 
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II. Catalytic activities 

In order to better define the structural and 
catalytic properties of the Group I introns, exon 
sequences have been stripped from the "core" of the T. 
5 thermophila intron. Cech, T.R. et al., WO 88/04300, 

describes at least three catalytic activities 
possessed by the Tetrahymena intron ribozyme: (1) a 
dephosphorylating activity, capable of removing the 3' 
terminal phosphate of RNA in a sequence-specific 
10 manner, (2) an RNA polymerase activity (nucleotidyl 

transferase) , capable of catalyzing the conversion of 
oligoribonucleotides to polyribonucleotides, and (3) 
a sequence-specific endoribonuclease activity. 

Isolated ribozyine activities can interact with 
15 substrate RNAs in trans, and these interactions 

characterized. For example, vhen truncated forms of 
the intron are incubated with sequences corresponding 
to the 5' splice junction, the site undergoes 
guanos ine-dependent cleavage in mimicry of the first 
20 step in splicing. The substrate and 

endoribonucleolytic intron RNAs base-pair to form 
helix PI, and cleavage occurs after a U:G base-pair at 
the 4th-6th position. Phylogenetic comparisons and 
mutational analyses indicate that the nature of the 
25 sequences immediately adjacent the conserved uracil 

residue at the 5' splice site are unimportant for 
catalysis, provided the base-pairing of helix PI is 
maintained (Doudna, J. A. et al. f Proc. Natl. Acad. 
Sci. USA 86: 7402-7406 (1989)). 

The sequence requirements for 3' splice-site 
selection appear to lie mainly within the structure of 
the intron itself, including helix P9.0 and the 
following guanosine residue which delineates the 3' 
intron boundary. However, flanking sequences within 
35 tne 3 / eX on are required for the formation of helix 

P10 and efficient splicing, as shown by mutational 
analysis (Suh, E.R. et al . , Wol . Cell . Biol. 10:2960- 



30 
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2965 (1990)) . in addition, oligonucleotides have been 
iigated in trans, using a truncated for* of the 
intron, and -external- guide sequence and 
oligonucleotides which had been extended by .5 
guanosine residue. The substrate oligonucleotides 
corresponding to 3' exon sequences were aligned solely 
by the formation of PlO-like helices on an external 
template, prior to ligation (Doudna, J. A. et ml.. 
Nature 335:519-522 (1989)). 

The cleavage activity of ribozymes has been 
targeted to specific RNAs by engineering a discrete 
••hybridization- region into the ribozyme, such 
hybridization region being capable of specifically 
hybridizing with the desired RNA. For example, 
Gerlach, W.L. et ml., EP 321,201, constructed a 
ribozyme containing a sequence complementary to a 
target RNA. Increasing the length of this 
complementary sequence increased the affinity of this 
sequence for the target. However, the hybridizing and 
cleavage regions of this ribozyme were integral parts 
of each other. Upon hybridizing to the target RNA 
through the complementary regions, the catalytic 
region of the ribozyme cleaved the target. It was 
suggested that the ribozyme would be useful for the 
inactivation or cleavage of target RNA in vivo, such 
as for the treatment of human diseases characterized 
by the production of a foreign host's RNA. However, 
ribozyme-directed trans-splicing, (as opposed to 
trans-cleavage) was not described or suggested. 

The endoribonuclease activities (the cleavage 
activities) of various naturally-occurring ribozymes 
have been extensively studied. Analysis of the 
structure and sequence of these ribozymes has 
indicated that certain nucleotides around the cleavage 
site are highly conserved but flanking sequences are 
not so conserved. This information has lead to the 
design of novel endoribonuclease activities not found 
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in nature. For example, Cech and others have 
constructed novel ribozymes with altered substrate 
sequence specificity (Cech, T.R. et al., WO 88/04300; 
Koizumi, M. et al., FEBS Lett. 226:228-230 (1988); 
Koizumi, M. et al., FEBS Lett. 239:285-288 (1988); 
Haseloff, J. et al., Nature 334:585-591 (1987); and 
Heus, H.A. et al., Wucl. Acids Res. 28:1103-1108 
(1990)). From early studies of the self -cleaving 
plant viroids and satellite RNAs (Buzayan, J.M. et 
al., Proc. Natl. Acad. Sci. USA 83:8859-8862 (1986), 
guidelines for the design of ribozymes that are 
capable of cleaving other RNA molecules in trans in a 
highly sequence specific have been developed 
(Haseloff, J. et al., Nature 334:585-591 (1988)). 
However, these constructs were unable to catalyze 
efficient, targeted trans-splicing reactions. 

The joining of exons contained on separate RNAs, 
that is, trans-splicing, occurs in nature for both 
snRNP-mediated and self-catalyzed group I and group II 
introns. In trypanosome and Caenorhabditis elegans 
mRNAs, common 5' leader sequences are transcribed from 
separate genes and spliced to the 3' portions of the 
mRNAs (Agabian, N. , Cell 61:1157-1160 (1990); Hirsh, 
D. et a2., Mol. Biol. Rep. 24:115 (1990). These small 
25 "spliced leader" RNAs (slRNAs) consist of the 5' exon 

fused to sequences that can functionally substitute 
for Ul snRNA in mammalian snRNP-splicing extracts. 

Also, both the group I and group II self-splicing 
introns are capable of exon ligation in trans in 
30 artificial systems (Been, M.D. et al., Cell 47:207-216 

(1986); Galloway-Salvo, J.L. et al., J. Mol. Biol. 
221:537-549 (1990); Jacquier, A. et al . , Science 
234:1099-1194 (1986); and Jarrell, K.A. et al., Mol. 
Cell Biol. 8:2361-2366 (1988)). Trans-splicing occurs 
35 i n v ivo for group II introns in split genes of 

chloroplasts (Kohchi, T. et al., Nucl . Acids Res. 
26:10025-10036 (1988)) , and has been shown for a group 
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I intron in an artificially split gene in Escherichia 
coli (Galloway-Salvo, J.L. et al., J- Hoi. Biol. 
221:537-549 (1990)). In the latter case, a 
bacteriophage T4 thymidylate synthase gene (td) 
containing a group I intron was divided at the loop 
connecting the intron helix P6a. Transcripts of the td 
gene segments were shown to undergo trans-splicing in 
vitro, and to rescue dysfunctional E. coli host cells. 
Known base-pairings (P3, P6 and P6a) and possible 
tertiary interactions between the intron segments, 
allowed correct assembly and processing of the gene 
halves . 

In vitro, the Tetrahymena ribozyme is capable of 
catalyzing the trans-splicing of single-stranded model 
oligoribonucleotide substrates. Four components were 
necessary: ribozyme, 3' single-stranded RNA, 5' exon 
and GTP. A shortened form of the Tetrahymena 

ribozyme (L-21 Seal IVS RNA) , starting at the internal 
guide sequence and terminating at U 409 has been used in 
such a reaction (Flanegan, J.B. et al., J. Cell. 
Biochem. (Supp.)12 part D:28 (1988)) , Attack by GTP at 
the 5' splice site released the 5' exon which was then 
ligated by the ribozyme to the 3' exon in a 
transesterification reaction at the 3' splice site. 

The in vivo use of ribozymes as an alternative to 
the use of antisense RNA for the targeting and 
destruction of specific RNAs has been proposed 
(Gerlach, W.L. et al., EP32l,201; Cotten, M. , Trends 
Biotechnol. fi:174-178 (1990); Cotten, M. et al., EMBO 
j. 8:3861-3866 (1989); Sarver, N. et al., Science 
247:1222-1225 (1990)). For example, expression of a 
ribozyme with catalytic endonucleolytic activity 
towards an RNA expressed during HIV-1 infection has 
been suggested as a potential therapy against human 
immunodeficiency virus type 1 (HIV-1) infection 
(Sarver, N. et al., Science 247:1222-1225 (1990); 
Cooper, M. , CDC AIDS Weekly, April 3, 1989, page 2; 
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Rossi, J.J., Abstract of Grant No. 1R01AI29329 in 
Dialog's Federal Research in Progress File 265). 
However, such attempts have not yet been successful* 

In a study designed to investigate the potential 
use of ribozymes as therapeutic agents in the 
treatment of human immunodeficiency virus type 1 (HIV- 
1) infection, ribozymes of the hammerhead motif 
(Hutchins, C.J. et al., Nucl. Acids Res. 14:3627 
(1986); Keese, P. et al . , in Viroids and Viroid-Like 
Pathogens, J.S. Semancik, ed., CRC Press, Boca Raton, 
FL, 1987, pp. 1-47) were targeted to the HIV-1 gag 
transcripts. Expression of the gag-targeted ribozyme 
in human cell cultures resulted in a decrease (but not 
a complete disappearance of) the level of HIV-1 gag 
RNA and in antigen p24 levels (Sarver, N. et al., 
Science 247:1222-1225 (1990)). Thus, the medical 
effectiveness of Sarver 's ribozyme was limited by its 
low efficiency since any of the pathogen's RNA that 
escapes remains a problem for the host. 

Another problem with in vivo ribozyme 
applications is that a high ribozyme to substrate 
ratio is required for ribozyme inhibitory function in 
nuclear extracts and it has been difficult to achieve 
such ratios. Cotton et al . achieved a high 

ribozyme to substrate ration by microinjection of an 
expression cassette containing a ribozyme-producing 
gene operably linked to a strong tRNA promoter (a 
polymerase III promoter) in frog oocytes, together 
with substrate RNA that contains the cleavage sequence 
for the ribozyme (Cotton, M. et al., EMBO J. 8:3861- 
3866 (1989). However, microinjection is not an 
appropriate method of delivery in multicellular 
organisms * 

The in vivo activity of ribozymes designed 
against mRNA coding for Escherichia coli P~ 
galactosidase has been reported (Chuat, J.-C. et al., 
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Biocne*. Biopnys. Res. — • 1*2:1025-1029 (1989)). 
However, this activity was only observed when the 
ribozyme and target were transfected into bacterial 
cells on the same molecule. Ribozyme activity was 
inefficient when targeted against an mRNA transcribed 
from a bacterial F episome that possessed the target 
part of the 0-galactosidase gene. 

Thus, current technological applications of 
ribozyme activities are limited to those which propose 
to utilize a ribozyme's cleavage activity to destroy 
the activity of a target RNA. Unfortunately, such 
applications often require complete destruction of all 
target RNA molecules, and/or relatively high 
ribozyme: substrate ratios to ensure effectiveness and 
this has been difficult to achieve. Most importantly, 
the modified ribozymes of the art are not capable of 
efficient, directed trans-splicing. 

Accordingly, a need exists for the development of 
highly efficient ribozymes and ribozyme expression 
systems. Especially, the art does not describe an 
effective means in which to destroy an existing RNA 
sequence or to alter the coding sequence of an 
existing RNA by the trans-splicing of a new RNA 
sequence into a host's RNA. 

etnm&P Y QF THF TWVENTION 

Recognizing the potential for the design of novel 
ribozymes, and cognizant of the need for highly 
efficient methods to alter the genetic characteristics 
of higher eukaryotes in vivo, the inventors have 
investigated the use of ribozymes to alter the genetic 
information of native RNA' s in vivo. These efforts 
have culminated in the development of highly effective 
trans-splicing ribozymes, and guidelines for the 
engineering thereof. 
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According to the invention, there is first 
provided an RNA or DNA molecule , such molecule 
encoding a trans-splicing ribozyme, such ribozyme 
being capable of efficiently splicing a new 3' exon 
sequence into any chosen target RNA sequence in a 
highly precise manner, in vitro or in vivo, and such 
molecule being novel in the ability to accommodate any 
chosen target RNA or 3' exon sequences, and in the 
addition of a complementary sequence which enhances 
the specificity of such ribozyme. 

According to the invention, there is also 
provided an RNA or DNA molecule, such molecule 
encoding a ribozyme, the sequence for such ribozyme 
being a fusion RNA, such fusion RNA providing a first 
RNA sequence that is sufficient for targeting such 
ribozyme to hybridize to a target RNA, and further a 
second RNA sequence, such second RNA sequence capable 
of being transposed into the target RNA, and such 
second RNA sequence encoding an RNA sequence foreign 
to the targeted RNA sequence. 

According to the invention , there is also 
provided an RNA or DNA molecule, such molecule 
encoding a conformational ly disrupted ribozyme of the 
invention, a pro-ribozyme, such pro-ribozyme being 
substrate-activated, that is, such pro-ribozyme 
possessing neglible or no self-cleavage or trans- 
splicing activity, until being reactived by specific 
interaction with target RNA. 

According to the invention, there is further 
provided an RNA or DNA molecule containing a ribozyme 
or pro-ribozyme expression cassette, such cassette 
being capable of being stably maintained in a host, or 
inserted into the genome of a host, and such cassette 
providing the sequence of a promoter capable of 
functioning in such host, operably linked to the 
sequence of a ribozyme or pro-ribozyme of the 
invention. 
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According to the invention, there is further 
provided a method for in-vitro trans-splicing, such 
method comprising the steps of (1) providing a 
ribozyme or pro-ribozyme of the invention and an 
appropriate substrate for such ribozyme or pro- 
ribozyme in vitro, (2) further providing in vitro 
reaction conditions that promote the desired catalytic 
activity of such ribozyme or pro-ribozyme; and (3) 
allowing such ribozyme or pro-ribozyme to react with 
such substrate under such conditions. 

According to the invention, there is further 
provided a method for in vivo trans-splicing, such 
method comprising the steps of (1) providing an RNA or 
DNA molecule of the invention to a host cell, (2) 
expressing the ribozyme or pro-ribozyme encoded by 
such molecule in such host cell, (3) expressing a 
substrate of such ribozyme or pro-ribozyme in such 
host cell, and (4) allowing such ribozyme or pro- 
ribozyme to react with such substrate in such host 
20 cell. 

According to the invention, there is further 
provided a method for inactivating the activity of a 
target RNA, such method comprising (1) providing a 
ribozyme or pro-ribozyme of the invention, such 
ribozyme or pro-ribozyme being catalytically active 
against such target RNA, (2) providing such target 
RNA, and (3) providing conditions that allow such 
ribozyme or pro-ribozyme to express its catalytic 
activity towards such target RNA. 

According to the invention, there is further 
provided a method for providing a desired genetic 
sequence to a host cell in vivo, such method 
comprising (1) providing a ribozyme or pro-ribozyme of 
the invention to a desired host cell, such ribozyme or 
pro-ribozyme being catalytically active against a 
target RNA in such host cell, (2) providing such 
ribozyme or pro-ribozyme encoding such desired genetic 
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sequence, and (3) providing conditions that allow such 
ribozyme or pro-ribozyme to trans-splice such desired 
genetic sequence into the sequence of the target RNA. 

According to the invention, there is further 
provided a method for engineering male or female 
sterility in agronomical ly important plant species, 
such method comprising providing a ribozyme or pro- 
ribozyme of the invention to a desired cell of such 
species, such ribozyme or pro-ribozyme being targeted 
to any RNA expressed in a cell necessary for 
fertility, such ribozyme or pro-ribozyme providing a 
sequence encoding a toxic product to the trans-spliced 
RNA. 

According to the invention, there is further 
15 provided a method of modifying the genetics of crop 

plants, such method comprising providing a germ cell 
of such crop plant with a ribozyme of the invention, 
such ribozyme encoding a sequence capable of 
conferring such desired genetic modification in such 

20 plant. 

According to the invention, there is further 
provided a method of immunizing plants against plant 
pathogens, such method comprising the construction of 
transgenic plants capable of expressing a plant 
25 pathogen-specific fusion ribozyme of the invention, 

and such ribozyme being capable of destroying or 
inhibiting the pathogen. 

According to the invention, there is further 
provided a transformed, pathogen-resistant 
microorganism, such microorganism being resistant to 
a desired pathogen, such microorganism being 
transformed with a ribozyme of the invention and such 
ribozyme providing a catalytic activity that targets 
a nucleic acid molecule expressed by such pathogen. 

According to the invention, there is further 
provided a viral pathogen capable of delivering a 
desired ribozyme activity to a desired host, such 
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ribozyme activity being delivered by a ribozyme of the 
invention. 

pEfrP TPTTON <?F T" T FTSURES 

Figure 1 is a diagram of the mechanism of 
ribozyme splicing of the group I intron. 

Figure 2 is a diagram of structure of the (A) 
Tetrahymena thermophila rRNA intron; (B) Target mRNA 
and trans-splicing ribozyme of the invention. 

Figure 3 (A) is a diagram of the design of a CAT- 
LacZ o-peptide trans-splicing ribozyme; 3(B) is the 
complete DNA sequence of the CAT-LacZ ribozyme. 

Figure 4 presents the sequences of cucumber 
15 mosaic virus (CMV) RNA 4 trans-splicing ribozymes. A: 

virus RNA target sequences; B: Oligonucleotide target 
sequences; C: CMV RNA4 - diphtheria toxin A-chain 
trans-splicing ribozymes. 

Figure 5 is a comparison of cucumber mosaic virus 

20 3/4 sequences. 

Figure 6 presents a partial sequence of wild-type 
DTA and DTA 3' and exon mutants. Fig. 6(c) is the 
complete coding sequence of a Gal 4 -DTA ribozyme with 
the isoleucine substitution. 

25 Figure 7 presents the rationale for "pro- 

ribozyroe" design. Arrows show sites of ribozyme 
cleavage, "antisense" regions are shown in black, 
catalytic domains are shown with radial shading, and 
3' "exon" sequences are shown with light shading. In 

30 the absence of the target mRNA, trans-splicing 

ribozymes may transiently base-pair, and react with 
heterologous sequences (including their own). In 
addition, scission at the «3' exon" junction will 
occur. Inactive "pro-ribozymes" are constructed to 

35 contain extra self-complementary sequences which cause 

the catalytic center of the ribozyme to be mis-folded. 
Active ribozymes are only formed after base-pairing 
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with the intended target mRNA - and consequent 
displacement of the interfering secondary structure. 

Figure 8 shows the sequence and predicted 
secondary structure of the CAT-LacZ trans-splicing 
ribozyme. Ribozyme "core" sequences are shaded (after 
Cech, Gene 73:259-271 (1988)). Helices P8 are shown 
for the unmodified ribozyme and pro-ribozymes 1 and 2, 
with 13 and 18 nucleotides, respectively, of sequence 
complementary to the "antisense" region (highlighted) . 

Figure 9 shows (1) active CAT-LacZ ribozyme shown 
schematically, with "antisense" , ribozyme domain with 
helix P8 and 3' "exon" sequences; (2) (a) Inactive 
CAT-LacZ pro-ribozyme 2 shown with base-pairing 
between sequences in the modified helix P8 and the 
"anti-sense" region; and (b) the active pro-ribozyme, 
after base-pairing with the CAT mRNA, displacement of 
the helix P8 - "antisense" pairing, and re-formation 
of helix P8, 

Figure 10 shows stability of CAT-LacZ pro- 
ribozyme transcripts. Plasmids containing the CAT- 
LacZ ribozyme and pro-ribozyme sequences were cleaved 
with £coRI and transcribed using T7 or SP6 RNA 
polymerase and [32-P]UTP. Radiolabeled transcripts 
were fractionated by 5% polyacrylamide gel 
electrophoresis in 7M urea and 25% formamide, and 
autoradiographed. The ribozyme transcripts underwent 
extensive hydrolysis, primarily at the "3' exon" 
junction. The pro-ribozyme forms were markedly less 
reactive. 

Figure 11 shows endoribonuclease activity of CAT- 
LacZ pro-ribozymes. Plasmids containing CAT-LacZ 
ribozyme and pro-ribozyme sequences were cleaved, with 
Seal, and transcribed with T7 or SP6 RNA polymerase. 
Transcripts were incubated for 30' at 37°C, 45°C and 
50°C in 40 mM Tris-HCl pH 7,5, 6 mM MgCl 2< 2 mM 
spermidine, 10 mM NaCl, 2 mM GTP with radiolabeled CAT 
RNA, transcribed using T7 RNA polymerase from plasmid 
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cut with Puvll. Products were fractionated by 5% 
polyacrylamide gel electrophoresis in 7M urea and 25% 
formamide, and autoradiographed. *NA ^edxated 
cleavage of the 173 nt (nucleotides) CAT RNA produces 
5' and 3' fragments of 76 nt and 97 nt, respectively. 

Figure 12 shows the -wild-type" and modified 
helices P8 used for pro-ribozyme design with possible 
base-pairs indicated in schematic form. Those bases 
which are complementary to the "anti-sense- portion of 
the corresponding pro-ribozyme , are shown in bold 
type The number of complementary bases is listed 
next to each helix. The helices are ordered by the 
stability of the corresponding pro-ribozyme 
transcripts, as measured by the degree of «3< exon" 
hydrolysis during in vitro transcription. 

Figure 13 shows the stability of GAL4-DTA pro- 
ribozymes. Plasmids containing ribozyme and pro- 
ribozyme sequences were linearized with Xhol and 
transcribed using T7 RNA polymerase. Transcripts were 
incubated for 60' at 50'C n 40 mM Tris-HCl P H 7.5, 6 
*K MgCl 2 , 2 mM spermidine, 10 mM H.C1, 1 mM GTP, were 
fractionated by 5% polyacrylamide gel electrophoresis 
in 7M urea and 25% formamide, and autoradiographed. 
Ribozyme transcripts are extensively hydrolysed under 
these conditions, while pro-ribozyme 1 is less so and 
pro-ribozyme 2 is stable. 

I. paf^nit-ions 

in the description that follows, a number of 
terms used in recombinant DNA (rDNA) technology are 
extensively utilized. In order to provide a clear and 
consistent understanding of the specification and 
claims, including the scope to be given such terms, 
the following definitions are provided. 
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ttibozvme . An RNA molecule that inherently 
possesses catalytic activity. 

Trf n S - SP lice . A form of genetic manipulation 
whereby a nucleic acid sequence of a first 
polynucleotide is colinearly linked to or inserted 
colinearly into the sequence of a second 
polynucleotide, in a manner that retains the 3'- 5' 
phosphodiester linkage between such polynucleotides. 
By "directed" trans-splicing or "substrate-specific" 
trans-splicing is meant a trans-splicing reaction that 
requires a specific specie of RNA as a substrate for 
the trans-splicing reaction (that is, a specific 
specie of RNA in which to splice the transposed 
sequence). Directed trans-splicing may target more 
than one RNA specie if the ribozyrne is designed to be 
directed against a target sequence present in a 

related set of RNAs. 

Target RNA . An RNA molecule that is a substrate 
for the catalytic activity of a ribozyrne of the 

20 invention. 

fv pi-uBBion cassette . A genetic sequence that 
provides sequences necessary for the expression of a 
ribozyrne of the invention. 

fftablv . By "stably" inserting a sequence into a 
genome is intended insertion in a manner thtt results 
in inheritance of such sequence in copies of such 
genome . 

pr ^.yo* linkage. An "operable linkage" is a 
linkage in which a sequence is connected to another 
sequence (or sequences) in such a way as to be capable 
of altering the functioning of the sequence (or 
sequences). For example, by operably linking a 
ribozyrne encoding sequence to a promoter, expression 
of the ribozyrne encoding sequence is placed under the 
influence or control of that promoter. Two nucleic 
acid sequences (such as a ribozyrne encoding sequence 
and a promoter region sequence at the 5' end of the 
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encoding sequence) are' said to be operably linked xf 
induction of promoter function results xn the 
transcription of the ribozyme encoding sequence and xf 
the nature of the linkage between the two sequences 
does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the 
expression regulatory sequences to direct the expres- 
sion of the ribozyme. Thus, a promoter region would 
be operably linked to a nucleic acid sequence xf the 
promoter were capable of effecting the synthesxs of 
that nucleic acid sequence. 

IX . rr-T«r~ r<™ of thP thP T " veT ^ on 

The trans-splicing ribozymes, pro-ribozymes and 
methods of the invention provide, for the first time, 
a ribozyme capable of directed trans-splicing into any 
RNA sequence, and especially into mature (non-intron- 
containing) mSNA. The trans-splicing ribozyme as 
described herein, with its extended complementarity to 
the target, greatly differs from T. thermophlla 
derived endoribonuclease activities described xn the 
art The additional complementarity of the ribozymes 
of the invention confers increased affinity and 
specificity for the target and the complementarity is 
not an integral part of the catalytic activity. In 
addition, cleavage occurs efficiently and precisely in 
the absence of denaturants and at high concentrations 

of Mg ++ . , 

The guidelines described herein for the design of 

trans-splicing ribozymes are conservative, based on 
the well characterized properties of group I self- 
splicing introns and are meant to provide a general 
scheme for the design of any directed trans-splicing 
ribozyme. ' Accordingly, the guidelines presented 
herein are not limited to the group I intron of the T. 
themophila pre-mRNA and may be used by one of skill 
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in the art to design a ribozyme of the invention with 
other group I introns using such guidelines and 
knowledge in the art. The native r. thermophila 
ribozyme (the intron sequence) is located from base 53 
to base 465 in the sequence below of the T. 
thermophila extrachromosomal rDNA: 

TGACGCAATT CAACCAAGCG CGGGTAAACG GCGGGAGTAA CTATGACTCT 
CTAAATAGCA ATATTTACCT TTGGAGGGAA AAGTTATCAG GCATGCACCT 
CCTAGCTAGT CTTTAAACCA ATAGATTGCA TCGGTTTAAA AGGCAAGACC 
GTCAAATTGC GGGAAAGGGG TCAACAGCCG TTCAGTACCA AGTCTCAGGG 
15 GAAACTTTGA CATGGCCTTG CAAAGGGTAT GGTAATAAGC TGACGGACAT 

GGTCCTAACC ACGCAGCCAA GTCCTAAGTC AACAGATCTT CTGTTGATAT 
GGATGCAGTT CACAGACTAA ATGTCGGTCG GGGAAGATGT ATTCTTCTCA 
20 TAAGATATAG TCGGACCTCT CCTTAATGGG AGGTAGCGGA TGAATGGATG 

CAACACTGGA GCCGCTGGGA ACTAATTTGT ATGCGAAAGT ATATTGATTA 
25 GTTTTGGAGT ACTCGTAAGG TAGCCAAATG CCTCGTCATC TAATTAGTGA 

CGCGCATGAA TGGATTA [SEQ ID NO.l] 

(Kan, N.C. et al., mcl . Acids Res. 10:2809-2822 
30 (1982)). 

As described herein, the directed trans-splicing 
ribozymes of the invention are engineered using the 
catalytic core of this intron. The intron, and its 
35 catalytic core can be isolated by methods known in the 

art. The catalytic core of the intron, that is, the 
truncated intron, differs form the full-length intron 
only in that it is truncated at the Seal site, thus 
removing the last five nucleotides of the intron. The 
truncated intron RNA may be prepared by techniques 
known in the art or may be purchased commercially in 
kit form from commercial sources such as, for example, 
product #72000 from US Biochemical, Cleveland, OH 
Tet 1.0 Kit). This US Biochemical kit 
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provides ribozyme and the protocol for the use of the 
ribozyme. Transcribed Tet.l cDNA may be used as the 
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substrate for polymerase chain reaction (PGR) 
mutagenesis as described below, to produce a synthetic 
trans-splicing enzyme. 

Substrate specificity of the ribozyme of the 
invention, that is, the ability of the ribozyme to 
-target" a specific RNA as a substrate, is conferred 
by fusing complementary sequences specific to the 
target (substrate) RNA to the 5' terminus of the 
ribozyme. 

Directed trans-splicing specificity of the 
ribozyme of the invention, that is, specificity m 
trans-splicing a desired foreign sequence of 
with the sequence of a target RNA, is conferred by 
providing a new 3' exon at the 3' terminus of the 
ribozyme. Details of the design are further provided 
below. 

To alter the structural and catalytic properties 
of the Group I introns, exon sequences replace the 
flanking sequences of such introns so that only the 
catalytic core of the intron, the ribozyme, remains. 
The resulting modified ribozyme can interact with 
substrate RNAs in trans. When truncated forms of the 
intron (i.e., the catalytic "core," i.e. truncated at 
the Seal site, removing the last five nucleotides of 
the intron) are incubated with sequences corresponding 
to the 5' splice junction of the native ribozyme, the 
site undergoes guanosine-dependent cleavage in mimicry 
of the first step in splicing. 

Engineering of the ribozymes of the invention 
requires consideration of the four guidelines that 

fOUO First, a splice site must be chosen within the 
target RNa! In the final trans -splicing complex, only 
the 5' portion of the PI duplex is contributed by the 
target RNA. Only a single conserved residue, uracil, 
is required immediately 5' of the intended splice 
site. This is the sole sequence requirement m the 
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target RNA. There is no inate structure required of 
the target RNA. Mature mRNA may be targeted and the 
trans-splicing reaction performed in the cell's 
cytoplasm rather than in the nucleus (against pre- 
mRNA) . This obviates the need for high concentrations 
of ribozyme in a cell's nucleus. 

Second, having chosen a particular target 
sequence, compensating sequence changes must be added 
to the 5' section of the ribozyme in order to allow 
the formation of a suitable helix PI between the 
target and ribozyme RNAs . It is highly desired is that 
the helix PI should contain a U:G base-pair at the 
intended 5' splice site, and should be positioned at 
the 4th # 5th (preferred) or 6th position from the base 
of the helix (Doudna, J. A., et al ♦ , "RNA Structure, 
Not Sequence Determines The 5' Splice-Site Specificity 
of a Group I Intron," Proc. Natl. Acad. Sci . USA 
85:7402-7406 (1989), incorporated herein by 
reference). For the native r. thermophila intron, PI 
extends for an additional 3 base pairs past the 
intended 5' splice site, and, in a preferred 
embodiment, this is maintained in the trans-splicing 
ribozyme of the invention. For trans-splicing to be 
efficient, the substrate and endoribonucleolytic 
intron RNAs must base-pair to form helix PI, with a 
resulting wobble U:G base-pair. Cleavage of the target 
RNA occurs at the phosphodiester bond immediately 3' 
to (after the) U:G base-pair. Phylogenetic 
comparisons and mutational analyses indicate that the 
nature of the sequences immediately adjacent the 
conserved uracil residue at the 5' splice site are 
unimportant for catalysis, provided the base-pairing 
of helix Pi is maintained. 

Third, the exon sequences flanking the 3' splice 
site must be chosen, and adjustments made in the 5' 
section of the ribozyme, if necessary, to allow the 
formation of a stable P10 helix. While the P10 helix 
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M y be dispense* with « «ce.s.ry. it. presence 
enhances splicing and preferred ~ 
ribotyme of the invention retain the P10 helix (Suh. 

„ erntl -id. Sequence Xncreases V Splice Site 
specificity in the Ktrahyaena self-Splicing rRNA 
iTtron - L. cell. Biol. *>: 2 9«- 2 9« C»..)>. *• 
wlcs PI end PiO overlap .Ion, the T. thermoplul. 
intronXGS. and the *nd and 3rd residues blowing 
bo" «e 5' and V splice .ites are complementary to 
t . residues in the ICS (Figure « • «-~ 

B ay M some advantage in following this, ««* «*■"! 
group I introns do not share this constant so the 
Lice of 3- «on sequences may be deterged 
airily by experimental considerations. Such 
co^ "tions reflect the wide flexibility in choice 
of space sites. For example, if it is desired to ,oin 
two sequences at a given point, the sequence at such 
point cannot be mutated or otherwise altered by the 
trans-splicing event. Either PI or P10 can be made 
sorter if the overlapping sequences don't otherwise 
accomodate the desired splice site. 

The sequence requirement, for 3' splice-site 
.election appear to lie main ly within ^ 
the intron (the ribozyme) itself, including tali* F9-0 
end the adjoining 3' guanosine rescue which 
Te in.«.s the 3' intron boundary. P9.0 is wholly 
contained within the intron sequences and helps def ine 
the adjacent 3- splice site. For the trans-splicing 
lesion the *9.0 helix and the rest of the functional 
^Tl.m?n« within the intron are not aitered The 
Ttructural characteristics of the P9.0 helix are Known 
(Michel. T. etal., "The Guanosine Binding Site o the 
retrains Ribozyme." Nature 342:391-395 (1989)) • 
However. flanKing sequences within the 3' exon are 
reared for the formation of helix ,10 and efficient 
splicing, as shown by mutational analysis. 
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Fourth, a region of complementary sequence is 
placed at the 5' terminus of the trans-splicing 
ribozyme in order to increase its affinity and 
specificity for the target RNA. The sequences 
involved in complementarity do not immediately abut 
sequences involved in PI helix formation but are 
separated by, for example, five nucleotides also 
involved in P10 formation. As shown herein, an 
arbitrary length of around 40 residues has been used. 
Other lengths may be used provided they are not 
detrimental to the desired effect. 

For example, starting with the T. thernophUa 
self-splicing intron (diagrammed below): 

l 

15 t pl | U A G C A A 

5 ' CUCUCUAAA 0 

I' ll* * 

CCCACGUUUCCAUUU 

20 I . . .GUAACGUA. . . 3' 
ribozyme core | plQ 

2 

(The "1" and "2" in the above diagram (and in other 
ribozyme diagrams throughout the application) note 
the first and second splice sites, respectively.) 

(1) a "5 /M site is chosen adjacent to a uracil 
residue within a chosen target RNA; 

(2) sequences complementary to the chosen RNA 
are fused to the 5' portion of the self-splicing Group 
1 intron. Base-pairing between ribozyme and target 
RNA allow formation the of the helix PI; 

(3) the chosen "3' exon" sequences .are fused to 
the 3' portion of the ribozyme, maintaining the 

35 conserved helix P10; and 

(4 ) to increase affinity for the target RNA, if 
desired, a section of extended sequence 
complementarity is fused to the 5' portion of the 
ribozyme to allow the formation of 30-40 base-pairs. 

The alignment of the resulting trans-splicing 
ribozyme with its target RNA may be diagrammed - 
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shewn 1— Ufly «• "* Se9Ue " C i : 

nf the top line. The ribozyme sequence i. 
represents th. top 

Srr-T^ - Hnes where, the h^t U. 

, lit the 5' and 3' ends, and PI ana 

of the nucleotides at "cne s 

P10, of the ribozyme may be seen. 
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According to the invention, trans-spHcing 
ribozymes can be designed that will trans-splice 
essentially any RNA sequence onto any RNA target. It 
is not necessary that the target contain an mtron 
sequence or that the ribozyme be an intron in the 
target sequence. For example, a strategy for such 
design may include (1) the identification of the 
desired target RNA (2) cloning and/or sequencing of 
the desired target RNA or portion thereof (3) 
selection of a desired coding sequence to trans-spUce 
into the target RNA, (4) the construction of a 
ribozyme of the invention capable of hybridizing to 
such target using the guidelines herein and (5) 
confirmation that the ribozyme of the invention will 
utilize the target as a substrate for the specific 
trans-splicing reaction that is desired and (6) the 
insertion of the ribozyme into the desired host cell. 

Choice of a target RNA will reflect the desired 
purpose of the trans -splicing reaction. If the 
purpose of the reaction is to inactivate a specific 
RNA then such RNA must be trans-spliced at a position 
that destroys all functional peptide domains encoded 
by such RNA and at a position that does not result in 
continued expression of the undesired genetic 
sequences. If Bore than one allele of the gene 
encoding such RNA exists, the ribozyme should 
preferablv be designed to inactivate the target RNA at 
a site common to all expressed alleles. 
Alternatively, more than one ribozyme may be provided 
to the cell, each designed to inactivate a specific 
allelic form of the target RNA. 

When only inactivation of the target RNA as 
desired, and not the expression of a new, desired RNA 
sequence, it is not necessary that the foreign RNA 
donated by the ribozyme provide a sequence capable of 
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being translated by the host cell, and a sequence 
containing translation^ stop codons may be used as a 
truncated intron, for example, the intron ribozyme 
truncated at the Seal site. 
5 if the purpose of the trajis-splicing reaction is 

to provide a genetic trait to a host cell, then the 
choice of target RNA will reflect the desired 
expression pattern of the genetic trait. If it is 
desired that the genetic trait be continuously 
10 expressed by the host, then the target RNA should also 

to be continuously expressed. If it is desired that 
the genetic trait be selectively expressed only under 
a desired growth, hormonal, or environmental 
condition, then the target RNA should also be 
selectively expressed under such conditions. 

It is not necessary that expression of the 
ribozyme itself be selectively limited to a desired 
growth, hormonal, or environmental condition if the 
substrate for such ribozyme is not otherwise present 
in the host as the ribozyme itself is not translated 
by the host. Thus, sequences encoded by the RNA 
donated by the ribozyme of the invention are not 
translated until the trans-splicing event occurs and 
such event may be controlled by the expression of the 
2 5 ribozyme substrate in the host. 

If desired, expression of the ribozyme may be 
engineered to occur in response to the same factors 
that induce expression of a regulated target, or, 
expression of the ribozyme may be engineered to 
provide an additional level of regulation so as to 
limit the occurrence of the trans-splicing event to 
those conditions under which both the ribozyme and 
target are selectively induced in the cell, but by 
different factors, the combination of those factors 
being the undesired event. Such regulation would 
allow the host cell to express the ribozyme 's target 
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„nder those conditions in which the ribozyme itself 
was not co-expressed. ^ 

The sequence of the noo*i 
hybridize, to the target KHA is determined by the 
sequence of the target FMA. The sequence cf the 
17Z F*A is determined after doning sequences 
encoding such FNA or after sequencing a peptide 
."cooed by such target and deducing an BHA sequence 
th" would encode such a peptide, Cloning techniques 
Town in the art may be used for the cloning of a 
sequence encoding a target UNA. 

The selection of a desired sequence to be trans 
spliced into the target RNA (herein termed the "trans- 
Z ced sequence", will reflect the purpose of the 
SL-spli=ing. If a trans- splicing event is desired 
that does not result in the expression cf a new 
genetic sequence, then the trans-spliced sequence need 
not encode a translatable protein sequence. If a 
trans-splicing event is desired that does result in 
the expression cf a new genetic sequence, and 
..Peci^y > new peptide or protein sequence, then the 
' "L-spliced sequence may further provide 
Taxational stop codons. and other information 
necessary for the correct trensl.tiona! proces sing of 
the RNA in the host cell. If • specific protein 
product is desired as a resu!t of the tr««Pl»e»» 
event, then it would be necessary to maintain the 
amino acid reading frame in the resulting fusion. 

The identification and confirmation of the 
soeoificity of a ribozyme of the invention is made by 
testing Native ribozyme's ability to catalyze the 
desired trL-splicin, reaction in the presence of the 
desired target sequence. The trans-splicing reaction 
should not occur if the only RNA sequences present are 
non-target sequences to which such ribozyme should not 
Z responsive (or less responsive, . Such 
characterization may be performed with the assistance 
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of a marker such that correct (or incorrect) ribozyme 
activity may be more easily monitored. In most cases, 
it is sufficient to test the ribozyme against its 
intended target in vitro and then transform a host 
cell with it for study of its in vivo effects. 

When it is desired to eliminate a host's RNA, 
such elimination should be as complete as possible. 
When it is desired to provide a new genetic sequence 
to a host cell, the trans-splicing reaction of the 
invention need not be complete. It is an advantage of 
the invention that, depending upon the biological 
activity of the peptide that is translated from such 
genetic sequence, the trans-splicing event may in fact 
be quite inefficient, as long as sufficient trans- 
splicing occurs to provide sufficient mRNA and thus 
encoded polypeptide to the host for the desired 
purpose . 

Transcription of the ribozyme of the invention in 
a host cell occurs after introduction of the ribozyme 
gene into the host cell. If the stable retention of 
the ribozyme by the host cell is not desired, such 
ribozyme may be chemically or enzymatically 
synthesized and provided to the host cell by 
mechanical methods, such as microinjection, liposome- 
mediated transf ection, electroporation, or calcium 
phosphate precipitation. Alternatively, when stable 
retention of the gene encoding the ribozyme is 
desired, such retention may be achieved by stably 
inserting at least one DNA copy of the ribozyme into 
the host's chromosome, or by providing a DNA copy of 
the ribozyme on a plasmid that is stably retained by 

the host cell. 

preferably the ribozyme of the invention is 
inserted into the host's chromosome as part of an 
expression cassette, such cassette providing 
transcriptional regulatory elements that will control 
the transcription of the ribozyme in the host cell. 
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Such elements may include, but not necessarily be 
limited to, a promoter element, an enhancer or UAS 
element, and a transcriptional terminator signal. 
Polyadenylation is not necessary as the ribozyme is 
not translated. However, such polyadenylation signals 
may be provided in connection with the sequence 
encoding the element to be trans-spliced. 

Expression of a ribozyme whose coding sequence 
has been stably inserted into a host's chromosome is 
controlled by the promoter sequence that is operably 
linked to the ribozyme coding sequences. The promoter 
that directs expression of the ribozyme may be any 
promoter functional in the host cell, prokaryotic 
promoters being desired in prokaryotic cells and 
eukaryotic promoters in eukaryotic cells. A promoter 
is composed of discrete modules that direct the 
transcriptional activation and/or repression of the 
promoter in the host cell. Such modules may be mixed 
and matched in the ribozyme' s promoter so as to 
provide for the proper expression of the ribozyme m 
the host. A eukaryotic promoter may be any promoter 
functional in eukaryotic cells, and especially may be 
any of an RNA polymerase I, II or III specificity. If 
it is desired to express the ribozyme in a wide 
variety of eukaryotic host cells, a promoter 
functional in most eukaryotic host cells should be 
selected, such as a rRNA or a tRNA promoter, or the 
promoter for a widely expressed mRNA such as the 
promoter for an actin gene, or a glycolytic gene. If 
it is desired to express the ribozyme only m a 
certain cell or tissue type, a cell-specific (or 
tissue-specific) promoter elements functional only in 
that cell or tissue type should be selected. 

The trans-splicing reaction is chemically the 
same whether it is performed in vitro or in vivo. 
However, in vivo, since cofactors are usually already 
present in the host cell, the presence of the target 
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and the ribozyme will suffice to result in trans- 
splicing. 

The embodiments described above would also apply 
to the construction of pro-r ibozymes . A trans- 

5 splicing ribozyme, as described above, consists of 

three fused sequence elements - a 5' "anti-sense" 
region which is complementary to the target RNA, the 
catalytic region which is based on a self-splicing 
Group I intron, and 3' "exon" sequences. The 5' 

10 region can base pair with the chosen target RNA, to 

bring it into proximity with the catalytic sequences 
of the Group I intron. The structure of the Group I 
intron provides a chemical environment suitable to 
catalyze the precise splicing of the target RNA with 

15 the 3' "exon" sequences. However, in the absence of 

the appropriate target RNA, the ribozyme sequences can 
still catalyze scission at the 3' "exon" junction 
(similar hydrolysis is seen for Group I self-splicing 
intons (Zaug et al., Science 231:470-475 (1986)), and 

20 way be able to catalyze illegitimate splicing events 

through transient base-pairing of the ribozyme with 
heterologous RNA sequences (which may include their 
own). Such side-reactions and illegitimate splicing 
events are unwanted, and may be deleterious. For 

25 example, if trans-splicing is to be used for 

conditional delivery of a toxin in vivo, illegitimate 
trans-splicing might result in unexpected expression 
of the toxic activity. Spontaneous cleavage at the 3' 
"exon" junction would lower the efficiency of trans' 

30 splicing. 

To help avoid these problems, "pro-ribozyme" 
forms of the trans-splicing RNAs have been constructed 
wherein a conserved helix (for example, helix P8) is 
disrupted. The pro-ribozymes are constructed to 

35 contain extra self-complementary sequences which cause 

the catalytic center of the ribozyme to be mis-folded. 
The pro-ribozymes are inactive in the absence of the 
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intended target RNA; active forms are only forced 
after base-pairing of the ribozyme and target RNAs 
with consequent displacement of the interfering 
secondary structure within the ribozyme Pro- 
ribozymes are intended to be catalytically inert 
species in the absence of the target RNA, to eliminate 
unwanted self -cleavage, self-splicing and illegitimate 
trans-splicing reactions in vitro and in vivo (Figure 

?) * The pro-ribozymes described here are 
conformational^ disrupted and therefore inactive 
forms of the trans-splicing activities. Thus the pro- 
ribozymes possess little self-cleavage activity. They 
are only re-activated by specific interaction with the 
target SNA, and thus are substrate-activated ribozymes 
which are less likely to catalyze trans-splicing to an 
unintended target RNA. Trans-splicing ribozymes are 
intended to be used for the delivery of new gene 
activities in vivo, and any reduction in the extent of 
unwanted side reactions or illegitimate splicing is 
desirable, and may be necessary. 

While the disruption of helix P8 has been 
exemplified here for the trans-splicing P ro-ribozymes, 
other helices which, are required for catalytic 
activity could also have been used. 

The same approach, of disrupting the conformation 
of a catalytically important structure in such a way 
that only base-pairing with the intended substrate RNA 
will allow the formation of an active ribozyme, could 
be applied to other ribozyme designs. For example, 
the loop sequence of a "hammerhead" type 
endoribonuclease (Haseloff et al., Nature 334: 585-591 
(1988)) could be extended and made complementary to 
one of the "anti-sense" arms of the ribozyme - similar 
35 to the above modification of helix P8. 

Endoribonuclease activity would only be exhibited 
after base-pairing with the chosen target RNA, 



20 
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displacement of the disrupting secondary structure, 
and reformation of the stem-loop structure required 
for catalysis. This would effectively increase the 
specificity of the ribozyme of its target. 

in addition, the activation of a pro-ribozyme 
need not rely on base-pairing with the substrate 
itself, instead, a chosen third RNA or ssDNA or even 
protein might be required for activity. An additional 
base-pairing or RNA-protein interaction would be 
required for the formation of an active ribozyme 
complex. The availability of such additional 
components would determine ribozyme activity, and 
could be used to alter ribozyme selectivity. 

The trans-splicing ribozymes, pro-ribozymes, and 
methods of the invention are useful in producing a 
gene activity useful for the genetic modification of 
targeted cells. For example, the trans-splicing 
reaction of the invention is useful to introduce a 
protein with toxic properties into a desired cell. 
The susceptibility of cells will be determined by the 
choice of the target SNA and the regulatory controls 
that dictate expression of the ribozyme. For example, 
a ribozyme or pro-ribozyme that transposes an RNA 
sequence encoding a toxic protein may be engineered so 
that expression of the ribozyme or pro-ribozyme will 
depend upon the characteristics of an operably-linked 
promoter. In a highly preferred embodiment, diphtheria 
toxin peptide A is encoded by that part of the 
ribozyme that is trans-spliced into a desired target 
in the host. Conditional expression of the ribozyme 
and diphtheria toxin peptide A chain results in the 
death of the host cell. Other potentially useful 
peptide toxins include ricin, exotoxin A, and herpes 
thymidine kinase (Evans, G.A. , Genes fi Dev. 3:259-263 
(1989)). In addition, various lytic enzymes have the 
potential for disrupting cellular metabolism. For 
example, a fungal ribonuclease may be used to cause 
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M l. sterility in plants (Mariani, C et al., 
347-737-741 (1990)). Particular tissue. might be 
^royed due to Umited expression 

rurther, « a viral » peered. 
s of viru» resistance, or therapies » j 

The ribozyme or pro-ribozym. of the invention may 

h. introduced into any host cell. P 1 *^ 0 " 0 " 

. , , » niant or mammalian 

euXaryotic and especially into a plant or 

host cell. «nd especially a human cell, either in 

culture or in vivo, using techniques Known in the art 

appropriate to such hosts. The ribozymes or pro- 

rlLzyme of the invention may also be engineered to 

viruses. In one embodiment, the ribozyme o 

pr o-ribozyme of the invention is provided in a 

oenetically stable manner to a host cell prior to a 

: ::: ****** » «- ^T^ru 

egression of the latent virus in such host cell, 
7esu in, in the appearance of the ribozyme-s or pro- 
'IbUvme target SNA in the host ceil) , would stimulate 
tneTalytic activity of the ribozyme and destruction 
of the viral FX* target and/or production of a toxin 
via trans-splicing resulting in death of the virus- 
Infected cell, in another embodiment, the »>••*»•« 
Pro-ribozyme may be engineered and packaged into the 
virus itsTlf. Such embodiments would be especially 
I erul Tn the design of viruses for investigative 
purposes, wherein the ribozyme or pro-r ibozyme may b 
designed to destroy the function of a specific viral 
Z and thus ailow the study of viral function in the 
"eofsuchRHA. viruses carrying ribozymes or 
pr "Iibozyme may also be used as carriers to transfect 
host cm with a desired ribozyme or pro-ribozyme 

" CtiV MaIe or female sterility may be engineered in 
agronomical important species using the ribozymes or 
pro-ribozymes of the invention. For example, male 
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sterility in tobacco nay be engineered by targetting 
TA29 or TA13 mRNA (iobacco anther-speeif ic genes; 
Seurinck, J. et al., Bud. Acids Res. 18:3403 (1990) 
with a ribozyme or pro-ribozyme of the invention that 
5 trans-splices the DTA 3' exon into those targets. 

The form of crop plants may be manipulated by 
selective destruction or modification of tissues using 
the ribozymes or pro-ribozymes of the invention. For 
example, seedless fruits may be made by targetting the 
10 seed storage protein mRNA with a ribozyme or pro- 

ribozyme of the invention that trans-splices the DTA 
3' exon into the target. 

Transgenic plants may be protected against 
infection by expression of virus-specific ribozymes or 
15 pro-ribozymes to kill infected cells. This would be an 

artificial form the "hypersensitive response." For 
example, cucumber mosaic virus coat protein mRNA may 
be targeted with a ribozyme or pro-ribozyme of the 
invention that trans-splices the DTA 3' exon into the 

20 target. 

populations of micro-organisms may be made 
resistant to specific pathogens by introduction of 
trans-splicing ribozymes or pro-ribozymes. For 
example, cheese-making bacteria may be made resistant 

25 to phage infection by targetting the phage RNA with a 

bacterial toxin gene or lytic enzyme encoded by the 3' 
exon provided by the ribozyme or pro-ribozyme of the 
invention, for example, which would interfere with 
phage replication by causing premature lysis after 

30 phage infection. 

Virus pathogens could be constructed to deliver 
toxic activities via trans-splicing. In this way, 
specific cell types could be targeted for ablation, 
such as for cancer or viral therapy. For example, HIV 

35 mRNA may be targeted by a ribozyme or pro-ribozyme of 

the invention that carries the DTA 3' exon, for either 
virus or liposome delivery. 
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The examples below are for illustrative purposes 
only and are not deemed to limit the scope of the 
invention. 



EXAMPLES 



. fyaffiPle 1 

P.ncfrnntio- ™* CharactPri 7,atjpn of a CAT-LflCZ 
r i rang-SPl i"j"q Pibozvme 



■hhe Invention 



Following the guidelines outlined above, a trans- 
splicing fusion ribozyme was designed that will splice 

20 a portion of the amino-terminal coding sequence of E. 

coli 0-galactosidase (LacZ) mRNA to a site in the 
chloramphenicol acetyl transferase (CAT) mRNA (Figure 
3) The sections of new sequence flanking the T. 
thermophila ribozyme core and the 3' exon were 

25 synthesized as oligonucleotides. The intact ribozyme 

sequence was then assembled by successive polymerase 
chain reactions, using the synthetic adaptor 
oligonucleotides as primers with ribozyme and 0- 
galactosidase DNA templates (while there are other 

30 methods available, this method is most convenient) . 

For the construction of a ribozyme capable of 
splicing /S-galactosidase (LacZ) a-peptide coding 
sequence to a site in the 5' coding sequence of the 
chloramphenicol acetyl transferase (CAT) , three 
35 oligonucleotides were synthesized. 

°i i ?rnrJ 1 l° G t CTT e CTTTA CGATG CCATT- GGGAT ATATC AACGG 
TGG?A £aa\ CCGT^ 'SlT TAAAA GTTAT CAGGC ATGCA CC-3 ' 
40 [SEQ ID NO. 2] 

Oligonucleotide 2 
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5'-GATTA GTTTT GGAGT ACTCG TACGG ATTCA CGGCC GTCGT 
TTTAC AA-3' [SEQ ID NO. 3] 

Olioonucleotide 3 

5'-GGCCG AATTC TTACA ATTTC CATTC AGGCT GCGCA ACTGT 
TGG-3 ' [SEQ ID KO. 4] 

Oligonucleotides 2 and 3 (200 pinoles each) were 
combined with 0.1 nq PvuII-cut pGEM4 DNA (which 
contained the LacZ o-peptide sequence) , and subjected 
to PCR amplification in a volume of 100 nl containing: 

50 mM KCl, 

10 mM Tris-HCl pH 8.3, 

1.5 mM MgCl 2 , 

0.4 mM dNTPs , 

0.1% gelatin, and 

5 U Tagl DNA polymerase, 

and incubated for 30 cycles, 1 min § 94*C, 2 mins § 
50°C, 2 mins 6 72 e C. 

Plasmid pGEM4 is commercially available from 
Promega Corporation, Madison WI, USA. 

The amplified product of 210 base-pairs was 
purified using low-gelling temperature agarose 
electrophoresis, and was used as primer in a second 
round of PCR amplification. 

Following the second round of PCR amplification, 
2.0 ng of 210 base-pair amplified product, 200 pmoles 
oligonucleotide 1 and 0.1 ng 450 base-pair fragment 
containing the T. thermophila IVS were mixed and 
subjected to PCR amplification using the conditions 
shown above. The resulting 660 base-pair product was 
digested with the restriction endonucleases EedRl and 
HindlU, and cloned into the plasmid vector pGEM4 . 
The complete sequence of the CAT-LacZ o-peptide 
ribozyme DNA sequence is presented as SEQ ID NO. 5 and 
Figure 3B. 

The cloning vector containing the cloned 
sequences was transformed into, and propagated in, the 
bacterial host XLl/Blue (Strategene, La Jolla, 
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California), using techniques known in the art 
(Maniatis, Molecular Cloning, A Laboratory Gu^e, 2nd 
SSL, l.W. Cold Spring Harbor Laboratory, 
^Ushers). However, any bacterial host capable of 
stably »aintaining the vector »ay be used, for example 

^ Tplas^id .ay be extracted fro, the host cell 
for further analysis using techniques commonly known 
i the art (Maniatis, Molecular Cloning, *«T 
Guide, 2nd edition, 1989, Cold Spring Harbor 
Laboratory, Publishers) . 

n . tii "i-rr " " f r1 "" f " 1 "* 

Using standard procedures, cloned sequences were 
purified from the bacterial host and the plasaud 
Unearned using a restriction endonuclease that does 
not cut the ribozyme sequence (for example, *coRI), 
and transcribed using T7 RNA polymerase in a volume of 
100 pi, containing: 

5 M g linearized plasmid DNA, 
40 mM Tris-HC pH 7.5, 

6 mM MgCl 2/ 

2 mM spermidine, 
10mM NaCl, 

10mM DTT r . . . r a .32 p]UXP if 

Hb«luS WO* transcripS were desired", 

100 U RNasin, and 

50 U T7 RNA polymerase, 

and the reaction was incubated at 37 -C for 2 hours. 

UNA transcripts were purified by 5% 
polyacrylamide gel electrophoresis before use (TBE 7M 
urea gel) . R»» containing active T. tfcermopM 2 IV 
sequences undergo some spontaneous scission at the 3 
intron-exon junction during transcription. Fragments 
removed by electrophoretic purification for 
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clarity of analysis during subsequent trans -splicing 
assays. 

HI. Jp vitro rrfps— t>liH "° Reaction Conditions 

5 

Target and/or trans-splicing ribozymes are 
incubated under the following conditions: 
0.1-0.5 tig RNA component (amount depends on type of 
experiment, usually ribozyme in 5-fold excess of 

10 target) , 

30 mM Tris-HCl pH 7.5, 
100 mM NaCl, 

2mM GTP, 

5 mM MgCl-, ^ 
15 in a volume of 5 nl at 42 e C, 60 mins. 

The reaction is diluted with 95 til 0-1 *M 
Na 2 EDTA, 200 mM NaCl, and ethanol precipated. The 
20 RNAs are then analysed on 5% polyacrylamide gels 

containing TBE buffer, 7M urea and 25% formamide, and 
autoradiographed . 



25 



30 



35 
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IV. frggav of E nrinnucleolvtic Activity 

After base-pairing of the ribozyme and target, 
the first step in trans-splicing is the guanosine 
mediated cleavage of the target RNA at the intended 5 ' 
splice site. Annealing and trans-splicing may be 
performed in a buffer such as 30 mM Tris-HCl, pH 7.5, 
100 mM NaCl, 5 mM MgCl 2 , 2 mM GTP at 42°C. As the 3' 
splice site is dispensable for this reaction, 
truncated trans-splicing ribozymes should behave as 
highly-specific endoribonucleases. To test this 
activity, shortened in vitro transcripts of the CAT- 
LacZ o-peptide trans-splicing ribozyme described above 
(SEQ ID NO. 5 and Figure 3) were incubated with CAT 
mRNA sequences. The CAT-LacZ ribozyme cassette is on 
a Hindlll-ScoRI fragment. The Seal cleavage site 
marks a position 5 bases upstream of the 3' splice 
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site. The ribozyme specifically cleaved the target 
m at the expected .ingle sit. to produce the 
expected size fragments. 

. v. nr f "■»— nlrln. "-ction 

TO confirm the ability ef the CAT-L.C2 a-peptide 
ribozyme to catalyze the ligation of 3' exon 

with radiolabeled CAT ». Bibozyme tr.nscripts .were 
synthesized from DNA templates w hl =h had been 3 
treated at one of several positions, ranging from 
Z end of the ribozyme core through the exo 
sequence. Incubation with labelled CAT led to the 
formation of the expected spliced products whxch 
differed in length depending on the extent of 3' exon 

MqU T.ddition. a certain proportion of the =AT-Lac 2 
a-peptide ribozyme molecules underwent spontaneous 
cleavage at the 3' splice site during m «t«o 
transcription, similar to the intact 
intron. These cleaved forms, terminated at the 
ouanosin. residue adjacent the 3' splice site, »ere 
Lso incubated with CAT In this «- 

ribozyme itself is ligated to a 3' portion of the CAT 
RNA, Z produce a product of about 55 0 £ 
size This reaction is similar to the self 
circulation of the intact intron, and the same 
ligation product is found in the other trans-splicing 
3 q reactions. 

vi. fl-mr^y of tTir rr^ c -* plicinq 

The products from a CAT-LacZ a-peptide trans- 
splicing reaction were reverse-transcribed, and 

s » poi*-™- reaction us n in e g it r e : 

oligonucleotides complementary to sequences on either 
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side of the predicted splice sites. Amplified 
sequences were cloned and sequenced. Individual 
recombinants showed no variation from the expected 
sequence of the spliced products. As found in studies 
5 with the intact intron, splicing appears to be highly 

accurate. 

Accordingly, the studies above show that a trans- 
splicing ribozyme designed according to the guidelines 
of the invention is capable of accurate, effective 
10 trans-splicing in vitro. 

fvainple 2 

P » ^n nf a ^-^-ppHeinn PihnTvm*» tftqt prpvjdes 
plant V3 j^"s Resistance 
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Cucumber mosaic virus (CMV) is a pandemic virus 
with a large number of known strains. Nine sequence 
strains are shown in the region of the start of their 
coat protein cistron encoded in RNA 3 and the 
subgenomic mRNA 4 (SEQ ID NOS. 7-25; Figures 4(A) and 
5) two sites have been chosen which are conserved in 
sequence and downstream from the AUG start codon of 
the coat protein. Oligonucleotides for the 

construction of ribozymes capable of trans-splicing 
the ile-mutant form of DTA into the CMV coat protein 
mRNA are shown in Figure 4B and discussed below. 

The trans-splicing ribozymes shown in Figure 4C 
and D are targetted to the CMV virus sequences shown 
in Figure 4B and will result not only in the cleavage 
of the CMV RNA molecules but in the expression of 
diphtheria toxin A-chain in the infected cell. The 
trans-splicing cassettes shown in Figure 4 may be 
transformed into any CMV-susceptible plant species 
using techniques known in the art, and transgenic pro- 
geny challenged by CMV infection. The design of the 
ribozyme is such that virus infection is necessary to 
initiate toxin production via RNA trans-splicing 
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35 



^.use the ^^tirl^Tr^ 
Xocalized death of ~ < umit repUMtion 

r ^rrz £ ; - — - 

artificial hypersensitive response. 

TOTwr'"' 2 

rnrnit — ° f "" r ''" t r " nl "' gt PI& 

The B.ior criteria for successful design of a 
ribozyae that trans-splices a sequence encoding a 
toxic product are not only the efficient and precise 
catalysis of trans-splicing, hut also that expression 
of tie toxic gene does not occur in the ahsence of 

^nTrlbozyme molecules can undergo spontaneous 
scission at the 3' splice site. Given the extreme 
toxicity of DTA, it is important that any 
exon sequences not dive rise to toxic 

°Lts Th . 3' «xon of DTA contained an in-frame 
products. The conceivably 
methionine at position 13, wmcn 

aive rise to . truncated but toxic polypeptide. To 
luxate this possibility, the wild-type seouen e 
m^MA. ) [SEQ ID NO. 6 (DKA) and SEQ ID No. 38 
orctelnn was altered fro- methionine at this 
(protein)] , R2 -dta., I [SEQ ID No. 39) or 

position to isoleucine (Rz ^ lie' l » u „„,„t. 
leucine (Rz-DTA leu ) [HQ ID No. 40, m «o separat 
ribo*y»e constructions (Figure .«> . Transformation of 
nlst «11. ««h DTA ile or DTA leu resulted m no 
parent truncated, toxic peptide in the host cell. 

construction of pre-ribosy.es for txans-spliei., 

The design and construction of catalytic RNAs 
< ri bozy»e, that can specifically cleave a targeted 
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RNA, and splice a chosen RNA segment to the 5' portion 
is described above. Now novel means of constructing 
trans-splicing ribozymes which are conformational^ 
disrupted, and therefore inactive, in the absence of 
the target RNA, are presented. 



Design 



As a test for the design of pro-ribozymes, the 
CAT-LacZ trans-splicing ribozyme which described 
earlier was modified (Figure 7). Phylogenetic com- 
parisons and mutational analysis (for review, see 
cech, Ann Sev. Biochem. 59:543-568 (1990)) have 
indicated that a core region of the group I self- 
splicing introns is highly conserved and important for 
activity (Figure 8). For the construction of trans- 
splicing pro-ribozymes a helix immediately adjacent to 
this region, F8, was disrupted. In the first experi- 
ments, 13 or 18 nucleotides of new sequence were 
introduced into the 5' strand and loop of helix P8, to 
produce pro-ribozyme 1 and 2, respectively. The extra 
nucleotides were complementary to the 5' "anti-sense" 
portion of the ribozyme, while the flanking sequences 
were adjusted to conserve (1) the actual sequences at 
the base of P8, and (2) the extent of base-pairing 
possible within P8 (Figure B) . The extent of self- 
complementarity between the sequences inserted into 
helix P8 and the 5' "anti-sense- region of the pro- 
ribozyme is such that this new helix would be expected 
to form in nascent transcripts, in preference to helix 
P8 The formation of this alternative helix would 
also be expected to disrupt flanking secondary and 
perhaps tertiary interactions within the catalytic 
core of the ribozyme. Thus, mis-folding of the pro- 
ribozyme would render it catalytically inactive 
(Figure 9) . However, base-pairing of the pro-ribozyme 
with the intended target RNA would displace the P8- 
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..nti-sense- base-pairing, sequester the -anti-sense 
sequences and allow re-formation of the PS helix and 
an active catalytic domain. Displacement of the PS- 
■anti-sense" helix results in a greater sum of base- 
peirs and allows proper folding of the catalytic 
domain, so should be energetically favored. 



CXT-LacZ pro-ribosyiwa 



Cloned sequences corresponding to the two CAT- 
LacZ pro-ribozymes were constructed using PCR- 
*utagenesis as described above, and RNAs were produced 
by in vitro transcription. The CAT-LacZ trans- 
splicing ribozyme was observed to undergo scission 
L5 during transcription at the 3< splice junction, as a 

result of hydrolysis catalyzed by the intron 
sequences. Similar hydrolysis is seen in in vatro 
transcripts of the unmodified retrahyjnena thermophala 
intron. In contrast, transcripts of the different 
CAT-LacZ pro-ribozymes are more stable, with little 
cleavage evident under the same conditions (Figure 
10 ) This indicates that the pro-ribozymes are 
inactive, which would be expected if the catalytic 
sequences were mis-folded. Truncated forms of the 
pro-ribozymes were tested for specific 
endoribonuclease activity directed against the CAT 
RNA CAT-LacZ pro-ribozyme RNAs were transcribed from 
templates truncated at the Seal site, to remove the 3< 
splice junction and LacZ sequences. Both ribozyme and 
30 pro-ribozyme RNAs are stable after removal of the 3' 

splice site. incubation of the truncated pro- 
ribozymes with CAT RNA led to specific cleavage of the 
target RNA to give fragments of the expected sizes 
(Figure 11). Specific cleavage activity was seen at 

35 37, 45 and 50 degrees. 

Pro-ribozyme forms of the GAL4-DTA trans-splicmg 
ribozyme were also constructed (Figure 12) . Regions 
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of 20 nucleotides (complementary to the « anti-sense- 
region) were inserted into the 5' strand and loop of 
helix P8. The two pro-ribozymes differed in the 
extent of base-pairing possible in the modified 
helices P8, and GAL4-DTA pro-ribozyme 1 possessing 
both a longer stem and fewer (3) accessible bases in 
the loop. The helix P8 of GAL4-DTA pro-ribozyme 2 
more closely resembles that of the CAT-LacZ pro- 
ribozyme 2, with a larger loop (14 bases) containing 
sequences complementary to the "anti-sense" region. 
Transcripts of the GAL4-DTA pro-ribozymes are more 
stable than those of the unmodified ribozyme. In 
particular, pro-ribozyme 2 is mainly intact after 
incubation in conditions that result in essentially 
complete self -cleavage of the ribozyme form (3 0' « 
50»C, 10 mM Mgci 2 , 2 mM GTP, see Figure 13). 

Having now fully described the invention, it will 
be understood by those with skill in the art that the 
scope may be performed within a wide and equivalent 
range of conditions, parameters and the like, without 
affecting the spirit or scope of the invention or any 
embodiment thereof . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Haseloff, James 
lx ' Goodman, Howard M. 

(ii) TITLE OF INVENTION: Trans-Splicing Ribozymes 
(iii) NUMBER OF SEQUENCES: 51 

(1V) ^ffflil:^; Kessler, Goldstein I Fox 

\b\ STREET: 1225 Connecticut Avenue, N.W., Suite 300 



(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20036 

(v) COMPUTER READABLE FORM: 

K 1 (A) MEDIUM TYPE: Floppy disk 

rB^ COMPUTER: IBM PC compatible 

fC OPERATING SYSTEM: PC-DOS/MS-DOS 

(d{ SOFTWARE: Patentln Release #1.0, Version #1.25 

<vi) CURRENT APPLICATION DATA: 
( ' (A) APPLICATION NUMBER: PCT 

(B) FILING DATE: herewith 

(C) CLASSIFICATION: 

(VU) ^?A?^APPLICATION^NUMBER : US 07/642,333 
[b] FILING DATE: 17-JAN-1991 

(Viii) ATTORNEY /AGENT INFORMATION: 

1 (A) NAME: Goldstein, Jorge A 

(B REGISTRATION NUMBER: 29,021 

(C) REFERENCE / DOCKET NUMBER: 0609.3476604 

fix) TELECOMMUNICATION INFORMATION: 
1 ' (A) TELEPHONE: (202]833 ; 7533 
B TELEFAX: (202)B33-8716 



(2) INFORMATION FOR SEQ ID NO :1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

( B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQIDN0:1: 
TGACGCAATT CAACCAAGCG CGGGTAAACG GCGG6AGTAA CTATGACTCT 
CTAAATAGCA ATATTTACCT . TTGGAGGGAA AAGTTATCAG GCAT6CACCT 
CCTAGCTAGT CTTTAAACCA ATAGATTGCA TCGGTTTAAA AGGCAAGACC 



50 
100 
150 
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400 
450 



GTCAAATTGC GGGAAAGGGG TCAACAGCCG TTCAGTACCA AGTCTCAGGG 200 

GAAACTTTGA CATGGCCTTG CAAAGGGTAT GGTAATAAGC TGACGGACAT 250 

GGTCCTAACC ACGCAGCCAA GTCCTAAGTC AACAGATCTT CTGTTGATAT 300 

GGATGCAGTT CACAGACTAA ATGTCGGTCG GGGAAGATGT ATTCTTCTCA 350 
TAAGATATAG TCGGACCTCT CCTTAATGGG AGGTAGCGGA TGAATGGATG 
CAACACTGGA GCCGCTGGGA ACTAATTTGT ATGCGAAAGT ATATTGATTA 

GTTTTGGAGT ACTCGTAAGG TAGCCAAATG CCTCGTCATC TAATTAGTGA 500 

CGCGCATGAA TGGATTA 517 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 bases 

(B) TYPE: Nucleic Acid 

(C) STRANOEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2; 

GGCCAAGCTT CTTTACGATG CCATTGGGAT ATATCAACCG TGGTATAAAC 50 
CCGTGGTTTT TAAAAGTTAT CAGGCATGCA CC 82 

(2) INFORMATION FOR SEQ IQ N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GATTAGTTTT GGAGTACTCG TACGGATTCA CGGCCGTCGT TTTACAA 47 
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(2) INFORMATION FOR SEQ ID N0:4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 bases 

( B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

( xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GGCCGAATTC TTACAATTTC CATTCAGGCT GCGCAACTGT TGG 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B ) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Lin ear 

(ii ) MOLECULAR TYPE: Nucleic Acid 

( xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
GGGAGACCGG AAGCTTCTTT ACGATGCCAT TGGGATATAT GAACGGTGGT 
ATAAAGCCGT GGTTTTTAAA AGTTATCA6G CATGCACCTG GTAGCTAGTC 
TTTAAACCAA TAGATTGCAT CGGTTTAAAA GGCAAGACCG TCAAATTGCG 
GGAAAGGGGT CAACAGCCGT TCAGTACCAA GTCTCAGGGG AAACTTTGAG 
ATGGCCTTGC AAAGGGTATG GTAATAAGCT GACGGACATG GTCCTAACCA 
CGCAGCCAAG TCCTAAGTCA ACAGATCTTC TGTTGATATG GATGCAGTTC 
ACAGACTAAA TGTCGGTCGG GGAAGATGTA TTCTTCTCAT AAGATATAGT 
CGGACCTCTC CTTAATGGGA GCTAGCGGAT GAAGTGATGC AACACTGGAG 
CCGCTGGGAA CTAATTTGTA TGCGAAAGTA TATTGATTAG TTTTGGAGTA 
CTCGTACGGA TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC 
CCTGGCGTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTC6CCAG 
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CTGGCGTAAT AGCGAAGAGG CCCGCACCGA TCGCCCTTCC CAACAGTTGC 
GCAGCCTGAA TGGAAATTGT AAG 

(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6; 

TCTCGATGAT GTTGTTGATT CTTCTAAATC TTTTGTGATG GAAAACTTTT 

CTTCGTACCA CGGGACTAAA 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 bases 

(B) T ype: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 

GTTTAGTTGT TCACCTGAGT CGTGTGTTTT GTATTTTGCG TCTTAGTGTG 
CCTATGGACA AATCTGGATC TCCCAATGCT AGTAGAACCT CCCGGCGTCG 
TCGCCCGCGT AGAGGTTCTC GGTCCGCTTC TGGT 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 4 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
GTTTAGTTGT TCACCTGAGT CQT6TTTTCT TTGTTTT6CG TCTCAQT6TS 
CCTATGGACA AATCTQGATC TCCCAATGCT AGTAQAACCT CCCGGCGTCG 
TCGCCCGC6T AGAGGTTCTC GGTCCGCTTC TGGT 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
6TTATTGTCT ACTGACTATA TAGA6AGTGT TTGT6CTGTG TTTTCTCTTT 
TGT6TCGTAG AATTGAGTCG AGTCATGGAC AAAGAATCAA CCA6TGCTGG 
TCGTAACCGT CGACGTCGTC CGCGTCGTGG TTCCCGCTCC GCCCCCTCC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:10: 

GTTATT6TCT ACTGACTATA TAGAGAGTGT GTGTGCTGTG TTTTCTCTTT 
TGT6TCGTAG AATTGAGTCG AGTCATGQAT AAATCT6AAT CAACCAGTGC 
TG6TCGTAAC CGTCGACGTC GTCCGCGTCG TGGTTCCCGC TCCGCCTCCT 



CC 



50 
100 
134 



50 
100 
149 



50 
100 
150 
152 



03/05/2003, EAST Version: 1.03.0002 



WO 92/13090 



- 49 - 



PCT/US92/00278 



(2) INFORMATION FOR SEQ ID N0:11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 
AGAGAGTGTG TGTGCTGTGT TTTCTCTTTT GTGTCGTAGA ATTGAGTCGA 50 
GTCATGGACA AATCTGAATC AACCAGTGCT GGTCGTAACC GTCGACGTCG 100 
TCCGCGTCGT GCTTCCCGCT CCGCCCCCTC C 131 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

GTTATTGTCT ACTGATTGTA TAAAGAGTGT GTGTGTGCTG TGTTTCTCTT 50 

TTACGTCGTA GAATTGAGTC GAGTCATGGA CAAATCTGAA TCAACCAGTG 100 

CTGGTCGCAA CCGTCGACGT CGTCCGCGTC GTGGTTCCCG CTCCGCCCCC 150 
TCC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
6TTATTQTCT ACT6ACTATA TAQAGAGTGT GTGTGTGCTG TGTTTTCTCT 
TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
6CTGGTCGTA ACCGTCGACG TCGTTTGCGT CGTGGTTCCC GCTCCGCCTC 
CTCC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
GAGTGTGTAT GTGCTGTGTT TTCTCTTTTG TGTCGTAGAA TTGAGTCGAG 
TCATGGACAA ATCTGAATCA ACCAGTGCTG GTCGTAACCG TCGACGTCGT 
CCGCGTCGTG GTTCCCCCTC CGCCCCCTCC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

6TTATTGTCT ACTGACTATA TAGAGAGTGT GTGT6CT6TG TTTTCTCTTT 
TGTGTCGTAG AATTGAGTCG AGTCATGGAC AAATCT6AAT CAACCAGTGC 



50 
100 
150 
154 



50 
100 
130 



50 
100 



03/05/2003, EAST Version: 1.03.0002 



WO 92/13090 

- 51 - 



PCT/US92/00278 



TGGTCGTAAC CATCGACGTC GTCCGCGTCG TGGTTCCCGC TCCGCCCCCT 150 

OC 152 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 
GGAGGGGGCG GAGCGGGAAC CACGACGCGG ACGACGTCGA CGGTTACGAC 50 
CAGCCCTGGT AGATTCAGAT TTGTCCAT 78 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

TTTGCGTCTT AGTGTGCCTA TGGACAAATC TGGATCTCCC AATGCTAGT 49 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 
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TTTGCGTCTC AGTGTGCCTA TGGACAAATC TGGATCTCCC AATGCTAGT 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEON ESS : Single 
(0) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TTTGTGTCGT AGAATTGAGT CGA6TCATGG ACAAATCTGA ATCAACCAGT 

GCTGGT 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 

TTTGTGTCGT AGAATTGAGT CGAGTCATGG ATAAATCTGA ATCAACCAGT 
GCTGGT 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
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TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
GCTGGT 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

TTTACGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 

GCTGGT 

(2) INFORMATION FOR SEQ ID N0:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(Xi) SEQUENCE DESCRIPTION : SEQ ID N0:23: 

TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 
GCTGGT 



(2) INFORMATION FOR SEQ ID N0:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

TTTGTGTCGT A6AATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 50 

56 

GCTGGT 



(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TTTGTGTCGT AGAATTGAGT CGAGTCATGG ACAAATCTGA ATCAACCAGT 

GCTGGT 



(2) INFORMATION FOR SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

AATTTTTGTG TCGTAGAATT GAGTCGAGTC ATGGACAAAT CTGAATCAAC 

CAGTGCTGCA 
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(2) INFORMATION FOR SEQ ID N0:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 
(0) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:27: 
GCACTGGTTG ATTCAGATTT GTCCATGACT CGACTCAATT CTACGACACA 50 

51 



(2) INFORMATION FOR SEQ ID N0:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 
AATTTTGTGT CGTAGAATTG AGTCGAGTCA TGGAGAAATC TGAATCAACC 50 
AGTGCTGCA 59 



(2) INFORMATION FOR SEQ ID N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 
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AGCATTGGTA TCATCAGGTT TGT 

(2) INFORMATION FOR SEQ ID N0:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTTGATGATG TTGTTGATTC T 

(2) INFORMATION FOR SEQ ID N0:31 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 ^ n0 acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 
(xi) SEQUENCE DESCRIPTION: SEQIDN0:31: 



MET ASP LYS PHE ASP ASP VAL VAL ASP SER 
c 10 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3° bases 
{B ) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:32: 
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30 

ATGGACAAAT TTQATGATGT TGTTGATTCT 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Sing le 

(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 

AATTTTGTGT CGTAGAATTG AGTCGAGTCA TGGACAAATC TGAATCAACC 

AGTGCTGCA 



50 
59 



(2) INFORMATION FOR SEQ ID N0:34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bas€S 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii)' MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34: 

17 

AGCCATCCTT GGTTCAG 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULAR TYPE: Nucleic Acid 

( xi) SEQUENCE DESCRIPTION: SEQ ID N0:35: 
GTAAGGGTGG ATGTT 

(2) INFORMATION FOR SEQ ID N0:36: 

(i) SEQUENCE CHARACTERISTICS: 

{ A) LENGTH: 1° amino acids 

(B) TYPE: Am in0 aoid 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: P«Pt«« 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:36: 



MET ASP LYS SER GLU LEU ARG VAL ASP VAL 
c 10 



(2) INFORMATION FOR SEQ ID N0:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3° bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:37: 
ATGGACAAAT CTGAATTAAG GGTGGATGTT 



(2) INFORMATION FOR SEQ ID N0:38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: " * mln ° acidS 

(B) TYPE: Amino acid 
(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Peptide 



15 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:38: 

MET GLU ASN PHE SER SER TYR HIS GLY THR LYS 
1 5 10 



(2) INFORMATION FOR SEQ ID N0:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:39: 

TCTCGATGAT GTTGTTGATT CTTCTAAATC TTTTGTGATT GAAAACTTTT 

CTTCGTACCA CGGGACTAAA 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic Acid 

( C ) STRANDEDNESS : Sing le 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 

TCTCGATGAT GTTGTTGATT CTTCTAAATC TTTTGTGTTG GAAAACTTTT 
CTTCGTACCA CGGGACTAAA 



(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULAR TYPE: Nucleic Acid 

(Xi) SEQUENCE DESCRIPTION: SEQIDN0M1: 
ATGGA6AAAA AAATCACTGQ ATATACCACC GTTGATATAT C 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 ^ino acids 

(B) TYPE: Amino acid 
(D) TOPOLOGY : Linear 

(ii) MOLECULAR TYPE: Peptide 

(ix) SEQUENCE DESCRIPTION: SEQ ID N0:42: 

MET GLU LYS LYS ILE THR ASP SER LEU ALA VAL VAL LEU GLN ARG 
! 5 10 I 5 



ARG ASP 
17 



(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 bases 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULAR TYPE: Nucleic Acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:43: 
ATGGAGAAAA AAATTACGGA TTCACTGGCC GTCGTTTTAC AACGTCGT6A C 



51 



(2) INFORMATION FOR SEQ ID N0:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 



60 
120 
180 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GTCGACCTTT TTAAGTCGGC AAATATCGCA TGTTTGTTCG ATAGACATCQ AGTGBCTTCA 
AAAGTTATCA GGCATGCACC TGGTAGCTAG TCTTTAAACC AATAGATT6C ATCGGTTTAA 
AAGGCAAGAC CGTCAAATTG CGGGAAAGGG GTCAACAGCC GTTCAGTACC AAGTCTCAGG 

GQAAACTTTG AGATGGCCTT GCAAA6GGTA TGGTAATAAG CTGACGGACA TGGTCCTAAC 240 

CACGCAGCCA AGTCCTAAGT CAACAGATCT TCTGTTGATA TGGATGCAGT TCACAGACTA 300 

AATGTCGGTC GGGGAAGATG TATTCTTCTC ATAAGATATA GTCGGACCTC TCCTTAAT6G 360 

GAGCTAGCGG ATGAAGTGAT GCAACACTGG AGCCGCTGGG AACTAATTTG TATGCGAAA6 420 

TATATTGATT AGTTTTGGAG TACTCGTCTC 6ATGATGTTG TTGATTCTTC TAAATCTTTT 480 

6TGATTGAAA ACTTTTCTTC GTACCACGG6 ACTAAACCTG GTTATGTAGA TTCCATTCAA 540 

AAAGGTATAC AAAAGCCAAA ATCTGGTACA CAAGGAAATT ATGACGATGA TTGGAAAGGG 600 

TTTTATAGTA CC6ACAATAA ATACGACGCT GCGGGATACT CTGTAGATAA TGAAAACCCG 660 

CTCTCTG6AA AAGCTG6AGG CGTGGTCAAA GTGACGTATC CAGGACTGAC 6AAGGTTCTC 720 

GCACTAAAAG TGGATAATGC C6AAACTATT AAGAAAGAGT TAGGTTTAAG TCTCACTGAA 780 

CC6TTGATGG AGCAAGTCGG AACGGAAGAG TTTATCAAAA GGTTCGGTGA TGGTGCTTCG 840 

CGTGTAGTGC TCAGCCTTCC CTTCGCTGAG GGGAQTTCTA GCGTTGAATA TATTAATAAC 900 

TGG6AACAGG CGAAAGCGTT AAGCGTAGAA CTTGAGATTA ATTTTGAAAC CCGTGGAAAA 960 

CGTGGCCAAG ATGCGATGTA TGA6TATATG GCTCAAGCCT GTGCAGGAAA TCGTGTCAG6 1020 
CGATCTTTGT GACTCGAG 
(2) INFORMATION FOR SEQ ID N0:45: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 



1038 
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(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:45: 
/UJGGAGAAAA AAAUCACUGG AUAUACCACC GUUGAUAUAU CCCAAUGGCA UCGUAAAGAA 60 

72 

CAUUUUGAGG CA 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0;46: 
AAGCUUCUUU ACGAUGCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 
AGUUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 
GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 
MACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 
CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 
UCGGUCGGGG AAGAUGUAUU CUUCUCAUAA CAUAUAGUCG GACCUCUCCU UAAUGGGAGC 
UAGCGGAUGA AGUGAUGCAA CACUGGAGCC GCUGGGAACU AAUUUGUAUG CGAAAGUAUA 
UUGAUUAGUU UUGGAGUACU CGUACGGAUU CACUGGCCGU CCUGUUACAA CGUCGUGAC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

03/05/2003, EAST Version: 1.03.0002 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 

AAGCUUCUUU ACGAUGCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 60 

AGUUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 120 

GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACABCCGU UCAGUACCAA GUCUCAGGGG 1B0 

AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU 6ACG6ACAUG GUCCUAACCA 240 

CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 300 

UCGGUCGGGG AAGAUGUAUU CUUCUCAUAA CAUAUAGUCG GACCUCUCCU UAAUGGGAGC 360 

UAGCG6AUGA AGUGAUGCAA CACUGGAGCC GCUG6GAACU AAUUUGUAUG CGAAAGUAUA 420 

UUGAUUAGUU UUGGAGUACU CGUACGGAUU CACUGGCC6U CCUGUUACAA CGUCGUGAC 479 
(2) INFORMATION FOR SEQ ID N0:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48: 
AAGCUUCUUU ACGAUGCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 
AGUUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 
GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 
AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 240 
CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 
UCGGUCGGGA CCGUUGAUAU AUG6UUCAUA ACAUAUAGUC GGACCUCUCC UUAAUGGGAG 
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CUAGCG6AUG AAGUGAUGCA ACACUGGAGC CGCUGGGAAC UAAUUUGUAU GCGAAAGUAU 
AUUGAUUAGU UUUGGAGUAC UCGUACGGAU UCACUGGCCG UCCUGUUACA ACGUCGUGAC 



(2) INFORMATION FOR SEQ ID N0:49: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



PCT/US92/00278 

420 
480 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
AAGCUUCUUU ACGAUGCCAU UGGGAUAUAU CAACGGUGGU AUAAAGCCGU GGUUUUUAAA 
AGUUAUCAGG CAUGCACCUG GUAGCUAGUC UUUAAACCAA UAGAUUGCAU CGGUUUAAAA 
GGCAAGACCG UCAAAUUGCG GGAAAGGGGU CAACAGCCGU UCAGUACCAA GUCUCAGGGG 
AAACUUUGAG AUGGCCUUGC AAAGGGUAUG GUAAUAAGCU GACGGACAUG GUCCUAACCA 
CGCAGCCAAG UCCUAAGUCA ACAGAUCUUC UGUUGAUAUG GAUGCAGUAC AGACUAAAUG 
UCGGUCGGGA CCGUUGAUAU AUCCCAAACG GUUCAUAACA UAUAGUCGGA CCUCUCCUUA 
AUGGGAGCUA GCGGAUGAAG UGAUGCAACA CUGGAGCCGC UGGGAACUAA UUUGUAUGCG 
AAAGUAUAUU GAUUAGUUUU GGAGUACUCG UACGGAUUCA CUGGCCGUCC UGUUACAACG 
UCGUGAC 

(2) INFORMATION FOR SEQ ID N0:50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:50: 

GTCGACCTTT TTAAGTCGGC AAATATCGCA TGTTTGTTCG ATAGACATCG AGTGGCTTCA 60 

AAAGTTATCA GGCATGCACC TGGTAGCTAG TCTTTAAACC AATAGATTGC ATCGGTTTAA 120 

AA6GCAAGAC CGTCAAATTG CGGGAAAGGG GTCAACAGCC GTTCAGTACC AAGTCTCAGG 180 

G6AAACTTTG AGATGGCCTT 6CAAAGGGTA TGGTAATAAG CTGACGGACA TGGTCCTAAC 240 

CACGCAGCCA AGTCCTAAGT CAACAGATCT TCTGTTGATA T6GATGCAGT TCACAGACTA 300 

AATGTCGGTC GGGGAACAAC ATGCGATATT GTTCTCATAA GATATAGTCG GACCTCTCCT 360 

TAATGGGAGC TAGCGGAT6A AGTGATGCAA CACTGGAGCC GCTGGGAACT AATTTGTATG 420 

CGAAAGTATA TTGATTAGTT TTGGAGTACT CGTCTCGATG ATGTTGTTGA TTCTTCTAAA 480 

TCTTTTGTGA TTGAAAACTT TTCTTCGTAC CACGGGACTA AACCTGGTTA TGTAGATTCC 540 

ATTCAAAAAG GTATACAAAA GCCAAAATCT GGTACACAAG GAAATTATGA CGATGATTGG 600 

AAAGGGTTTT ATAGTACCGA CAATAAATAC GACGCTGCGG GATACTCTGT AGATAATGAA 660 

AACCCGCTCT CTGGAAAAGC TG6AGGCGTG GTCAAAGTGA CGTATCCAGG ACTGACGAAG 720 

GTTCTCGCAC TAAAAGTGGA TAATGCCGAA ACTATTAAGA AAGAGTTAGG TTTAAGTCTC 780 

ACTGAACCGT TGATGGAGCA AGTCGGAACG GAAGAGTTTA TCAAAAGGTT CGGTGATGGT 840 

GCTTC6CGTG TAGTGCTCAG CCTTCCCTTC GCTGA6GGGA GTTCTAGCGT TGAATATATT 900 

AATAACTGGG AACAGGCGAA AGCGTTAAGC GTAGAACTTG AGATTAATTT TGAAACCCGT 960 

GGAAAACGTG GCCAAGATGC GATGTATGAG TATATGGCTC AAGCCTGTGC AGGAAATCGT 1020 

GTCAGGCGA7 CTTTGTGACT C6AG 1044 
(2) INFORMATION FOR SEQ ID N0:51 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
6TCGACCTTT TTAAQTCGGC AAATATC6CA TQTTTQTTCG ATAQACATCS AQTGGCTTCA 
AAAQTTATCA 66CATGCACC TGGTAGCTAG TCTTTAAACC AATAGATTGC ATCGGTTTAA 
AABGCAAGAC CGTCAAATTG CGGGAAAGGG GTCAACAGCC GTTCAGTACC AAGTCTCAGG 
6GAAACTTTG AGATGGCCTT GCAAAGGGTA TGGTAATAAG CTGACGGACA TGGTCCTAAC 
CACGCAGCCA AGTCCTAAGT CAACAGATCT TCTGTTGATA TGGATGCAGT TCACAGACTA 
AATGTCGGTC GGGCAAACAT GCGATATTTG CCGTTTGTCA TAAGATATAG TCGGACCTCT 
CCTTAATGGG AGCTAGCGGA TGAAGTGATG CAACACTGGA GCCGCTGGGA ACTAATTTGT 
ATGCGAAAGT ATATTGATTA GTTTTGGAGT ACTCGTCTCG ATGATGTTGT TGATTCTTCT 
AAATCTTTTG TGATTGAAAA CTTTTCTTCG TACCACGGGA CTAAACCTGG TTATGTAGAT 
TCCATTCAAA AAGGTATACA AAAGCCAAAA TCTGGTACAC AAGGAAATTA TGACGATGAT 
TGGAAAGGGT TTTATA6TAC CGACAATAAA TACGACGCTG CGGGATACTC TGTAGATAAT 
BAAAACCCGC TCTCTGGAAA AGCTGGAGGC GTGGTCAAAG TGACGTATCC AGGACTGACG 720 
AAGGTTCTCG CACTAAAAGT GGATAATGCC GAAACTATTA AGAAAGAGTT AGGTTTAAGT 
CTCACTGAAC CGTTGATGGA GCAAGTCGGA ACGGAAGAGT TTATCAAMG GTTCGGTGAT 
GGTGCTTC6C GT6TAGTGCT CAGCCTTCCC TTCGCTGAGG GGAGTTCTAG CGTTGAATAT 
ATTAATAACT GGGAACAGGC GAAAGCGTTA AGCGTAGAAC TTGAGATTAA TTTTGAAACC 
CGTGGAAAAC GTGGCCAAGA TG0GAT6TAT GAGTATATGG CTCAAGCCTG TGCAGGAAAT 
CGTGTCAGGC GATCTTTGTG ACTCGAG 
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What is claimed is: 

1. a first polynucleotide molecule, said 
polynucleotide molecule comprising a sequence encoding 
a trans-splicing ribozyme, said ribozyme being capable 
of trans-splicing a second polynucleotide sequence 
into a target RNA sequence in vitro or in vivo, said 
ribozyme comprising sequences complementary to said 
target mRNA sequence. 

2. A polynucleotide molecule, said molecule 
comprising a sequence encoding a trans-splicing 
ribozyme, the sequence of said ribozyme comprising a 
fusion RNA, such fusion RNA providing (1) a first RNA 
sequence, said first RNA sequence being sufficient for 

15 targeting said ribozyme to hybridize to a target RNA, 

and (2) a second RNA sequence, said second RNA 
sequence being capable of being transferred colinearly 
into a target RNA as a result of the trans-splicing 
activity of said ribozyme. 
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3. The polynucleotide molecule of claim 2, 
wherein said second RNA sequence comprises a sequence 
that encodes a peptide toxic to the host cell. 

4. The polynucleotide molecule of claim 3, 
wherein said peptide is the DTA peptide. 

5. The polynucleotide molecule of claim 4, 
wherein said DTA peptide is a mutant peptide sequence. 

6. The polynucleotide molecule of claim 5, 
wherein said mutant peptide sequence comprises amino 
acids encoded by SEQ ID. No. 39. 

7. The polynucleotide molecule of claim 5, 
wherein said mutant peptide sequence comprises amino 
acids encoded by SEQ ID. No. 40. 
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8. The polynucleotide molecule of any one of 
claias'l-7, wherein said molecule is RNA. 

9. The polynucleotide molecule of any one of 
claims 1-7, wherein said molecule is DNA. 

10 A polynucleotide molecule comprising a 
ribozyme expression cassette, said cassette being 
capable of being stably inserted into the genome of a 
host, and said cassette comprising the sequence of a 
promoter capable of functioning in such host, 
operably-linked to the coding sequence of the 
polynucleotide of any one of claims 1-7. 

11. A host cell comprising the polynucleotide 
molecule of claim 10. 

12. The host cell of claim 11, wherein said host 
cell is a viral cell. 

13. The host cell of claim 11, wherein said host 
cell is a prokaryotic cell. 

14. The host cell of claim 11, wherein said host 
cell is a eukaryotic cell. 

15. The host cell of claim 14, wherein said 
eukaryotic cell is a plant cell. 

16. The host cell of claim 14, wherein said 
eukaryotic cell is an animal cell. 

17. The host cell of claim 16, wherein said 
animal is a mammal. 
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18. The host cell of claim 17, wherein said 
animal is human. 

19. A method for in vitro trans-splicing, such 
5 method comprising the steps of: 

(1) providing the polynucleotide molecule 
of any one of claims 1-7 in a trans-splicing reaction 
mixture, said polynucleotide comprising a sequence 
capable of hybridizing with a second polynucleotide; 
10 (2) providing said second polynucleotide to 

such reaction mixture; and 

(4) catalyzing the trans-splicing of said 
second polynucleotide under said conditions. 

15 20. A method for in vivo trans-splicing, said 

method comprising the steps of: 

(1) providing a polynucleotide of claim 8 

to a host cell; 

(2) expressing said ribozyme encoded by 
20 said molecule in said host cell; 

(3) expressing a substrate of said ribozyme 

in said host cell; and 

(4) catalyzing the trans-splicing of said 
ribozyme with such substrate in said host cell. 

25 

21. A method for inactivating the activity of a 
target RNA, said method comprising: 

(1) providing the polynucleotide of any one 
of claims 1-7 to a trans-splicing reaction mixture, 

30 said ribozyme possessing catalytic activity against a 

target RNA, said catalytic activity resulting in the 
inactivation of the functioning of said target RNA; 

(2) providing said target RNA to said 
mixture; and 

35 (3) providing conditions that allow said 

polynucleotide to express said catalytic activity. 
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22. A method for providing a desired genetic 
sequence to a host cell in vivo, said method 

comprising: 

(1) providing the polynucleotide of claim 

5 8 to said host cell, said polynucleotide possessing 

catalytic activity against a target RNA in said host 
cell, said ribozyme being capable of trans-splicing 
said desired genetic sequence; 

(2) providing said target RNA in said host 

10 cell; and 

(3) providing conditions that allow said 
ribozyme to trans-splice said desired genetic sequence 
into the sequence of said target RNA. 

23. A method for engineering male or female 
sterility in a plant, said method comprising providing 
the polynucleotide of claim 8 to a germ cell of said 
species, said ribozyme being targeted to an RNA that, 
when expressed as a protein, is necessary for the 
fertility of said plant. 
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24. A method of conferring a desired genetic 
trait in a plant, said method comprising providing a 
germ cell of said plant with the polynucleotide of 
claim 8, said ribozyme encoding ' a trans-splicing 
sequence capable of conferring said desired genetic 
trait in said plant. 

25. A method for immunizing plants against a 
plant pathogen, said method comprising the 
transforming plant cells with the polynucleotide of 
claim 8, wherein said polynucleotide encodes a trans- 
splicing sequence capable of providing immunity to 
said pathogen to said plant. 

26. The method of claim 25, wherein said 
pathogen is cucumber mosaic virus. 
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27. A method for construction of a pro-ribozyme, 
wherein said method comprises disruption of helix P8. 

28. The method of claim 27 , wherein said 
disruption is caused by P8-anti-sense base-pairing. 

29. The polynucleotide molecule of any one of 
claims 1-7 wherein said ribozyme is a pro-ribozyme. 

30. The polynucleotide molecule of any one of 
claims 8 wherein said ribozyme is a pro-ribozyme. 

31. The polynucleotide molecule of any one of 
claims 9 wherein said ribozyme is a pro-ribozyme. 

32. The polynucleotide molecule of any one of 
claims 10 wherein said ribozyme is a pro-ribozyme. 

33. The host cell of claim n # wherein said 
ribozyme is a pro-ribozyme. 

34. The method of claim 19 , wherein said 
ribozyme is a pro-ribozyme. 

35. The method of claim 20, wherein said 
ribozyme is a pro-ribozyme. 

36. The method of claim 21 , wherein said 
ribozyme is a pro-ribozyme. 

37. The method of claim 22 , wherein said 
ribozyme is a pro-ribozyme. 

38. The method of claim 23 , wherein said 
ribozyme is a pro-ribozyme. 



03/05/2003, EAST Version: 1.03.0002 



WO 92/13090 



PCT/US92/00278 



- 72 - 



39. The method of claim 24, wherein said 
ribozyme is a pro-ribozyme. 

40. The method of claim 25, wherein said 
ribozyme is a pro-ribozyme. 
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Self-splicing of Group I introns. 



1. Guanosine-mediated cleavage of 5' splice site. 



Stxon 



3' exon 



2, Cleavage of 3' splice site. 



5* exon 



3' exon 



3. Ligation of exon segments. 



5 exon 



3' exon 



4. Circularization of intron. 
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PUT. d;ritf*T 




*!»»*fty, Jwiwiry 15,'a»»i >k 



•1C *CAT-Ue2 *t»-Rt •> U«t 



CGCACACCGGAA . .1 CCAAATTCXAAS ain*tz 



1 CCCASACCCC AAGCTTCTTT ACOATGCCAT 
41 OCTT7TTWA AGTTATCAC6 CAT5CACCTC 
121 CCC77TUUU CCCAA&ACCS TCAAATTGCG 
It! CTCTCASW* AA^CTXTSAG ATOJCCTTCC 
241 CSCCIAACCA C C C AC CC AA C TCCIXXCTCX 
301 ACA6ACTAM tcjfcCCTCGG CGAAGATCTA 
941 CTTAATCttA CCJASCCCAT CAACTGATCC 
431 SCCCAAACTA DOTCATXAC T3TT«ACTA 
411 AACGWTtt CtJWGAAAAC CCTCGCCTX* 
fll.CTTTCCCCAC CTpCCCT^AT ACCGAABAfir, 
401 CCA5CCTCAA TCGAAATTC? AAS 

| i0 I 20 I 30 



I 40 



1 *0 



< 40 



ICOSAtATXT CAACCWCS* ATAAACCCCT €0 
CXACCTACIC TTSAAACCAA TACATXCCAT 120 
OfiikAAOGGGT CAACAGCCCT TC ACTACCAA 110 
JuJkCCCTAIC CXAAIAAGCT GAOCSACATC 340 
ACMAtCTTC tCTTCATATC CATCCACTTC 300 
TTCTICTCAT AACATAXACT CC &A Cg T C I C 340 
AACACTCCXC COGCTGGGAA CXAATTTC7A 420 
CTGCSACOGA TICACICCCC CTCCTTTXAC 410 
CefaaCTTAA TC6CCTTCCA CCACATCCCC M0 

cccncMtnA xcecccncc caacacttcc coo 

I ■ 423 

I 40 1 so I io 
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Cucumber iiosaic virus 



Virus ILKA 

CMV isolates. 



t 

target sequences. I 

i - ' 



4 trans-splicing ribozymes. 



WD 

Q 

Fay 
K 
I 
0 

r 
c 
c 



TOUGCGUCU. .UAGUGUGCCU J^Z£ GAC 

UUUGCGUCUCAG... . . „ .UGUGCCU il2£ GAC 
OUUGT7GUCGUAGAAUUGAGUCGAGUC £22 GAC 
UUUGUOTCOTAGAAUUGAGU CGAGUC AOfi GA0 
UUUGUGUCGyAGAAOUGAGUCGAGUC 2125 GAC 
UUUACGUCGUAGAAUUGAGUCGAGUC AP^ GAC 
PUUGUGU CGUA GAAUUGAGUCGAGUC itlfi GAC 
2™**^^ 2&S GAC 

Uw^OTCOTAGAAUUGAGUC^AGUC 2I2S GAC 



a 
I 

AAA U.CU 
AAA U.CU 
AAA U.CU 
AAA U.CU 
AAA U.CU 
AAA U.CU 
AAA U.CU 
AAA U.CU 
AAA U.CU 



Oligonucleotide target sequences. 



2 
f 

GGA U.CU CCC AAU GCU AGU- 
GGA U.CU CCC AAU GCU AGU 
GAA U.CA ACC AGU GCU GGU - 
GAA U.CA ACC AGU GCU GGU . 
GAA U.CA ACC AGU GCU GGU • ^ 
GAA U.CA ACC AGU GCU GGU — 

GAA U.CA ACC AGU GCU GGU 

GAA U.CA ACC AGU GCU GGU _ 
GAA U.CA ACC AGU GCU GGU ^ ' 

3 < 



AATTOT^GTAGAAMGAGTCGAGTC Alfi GAC AAA 7.CT GAA T.'cA ACC AGT GCT GCA ^ 
AACACAGCATCTTAACTCAGCTCAG TAC CTG Tt* I.GA iii i.ii iii i^ ^ 

1 : I r Pst i 



ZcoR I 



CMV RNA 4 - 

libozyme 1 



210 

*7 



diphtheria toxin ! A-chaio trans-splicing ribozymes. 
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